Title of the Invention 

INHIBITING THE GROWTH OF BACTERIAL BIOFILMS 
Cross-Reference to Related Applications 

The present invention claims the benefit of U.S. provisional application no. 
5 60/464,333 filed April 22, 2003 and U.S. provisional application no. 60/517,391 filed 
November 6, 2003, the entire contents of both applications are incorporated herein by 
reference. 

BACKGROUND OF THE INVENTION 
Field of the Invention 

10 The present invention relates to the inhibition or reduction in bacterial biofilm 

growth and development. 
Discussion of the Background 

Surface attached, matrix-enclosed communities, called biofilms, cause serious 
economic and health problems due to biofilm-associated phenotypes such as antibiotic 

15 resistance or biofouling Costerton, J. W., Stewart, P. S. & Greenberg, E. P. (1999) 
Science 284, 1318-22. The inherent resistance to antimicrobial agents are the root of 
many persistent and chronic bacterial infections as nosocomial infections and legionaire's 
disease. The drastic phenotypic changes seen in biofilms led to the assumption that the 
physiological modifications necessary for planktonic bacteria to adopt the biofilm 

20 lifestyle must involve specific responses. However, biofilm physiology is still poorly 
understood and, whereas the early events of biofilm formation are well documented, little 
is known about the nature of the physiological changes and critical regulatory processes 
occurring inside mature biofilms. Global expression profiling comparing protein 
synthesis in Pseudomonas planktonic and biofilm bacteria suggested that a large number 

25 of genes could be differentially regulated during biofilm development (Sauer, K., Camper, 
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A. K., Ehrlich, G. D., Costerton, J. W. & Davies, D. G. (2002) J Bacteriol 184, 1140-54; 
Sauer, K. & Camper, A. K. (2001) J Bacteriol 183, 6579-89; Whiteley, M., Bangera, M. 
G., Bumgamer, R. E., Parsek, M. R., Teitzel, G. M., Lory, S. & Greenberg, E. P. (2001) 
Nature 413, 860-4). Although these pioneering studies opened the way to the genetic 
5 characterization of the biofilm phenotype, extracting functional information from 
genomic approaches remains a challenge. 

Escherichia coli K12, a widely used bacterial model, does not spontaneously form 
extensive biofilms. However, it has been previously shown that expression of pili from 
conjugative plasmids, which are widespread in natural bacterial populations, promotes 
1 0 the development of mature biofilms (Ghigo, J. M. (2001) Nature 412, 442-5). This raised 
the possibility of studying the genetic basis of the biofilm phenotype in E. coli K12 where 
expression profiling can be combined with the phenotypic analysis of a large set of 
deletion mutants. 

In view of the above, there remains an urgent need to develop new strategies for 
15 combating the development of mature biofilms Based on the discovery of the genes 
involved in the development of mature biofilms, the present invention provides targets to 
disrupt the development, formation and/or maturation of bacterial biofilms, and 
molecular tools to characterize and detect mature biofilms. 

20 SUMMARY OF THE INVENTION 

Thus, the present invention is based on the discovery of the unique expression of 
genes during the formation of bacterial biofilms thereby providing a target to reduce, 
ameliorate, attenuate, inhibit and/or treat biofilms. 

Accordingly, one aspect of the present invention is to a method of treating, 
25 reducing, ameliorating, attenuating and/or inhibiting the formation of biofilms by 



targeting the specific genes that are involved in the formation of the biofilm. These 
methods can be accomplished by contacting an already formed biofilm and/or a sample, 
surface or other substrate that may be susceptible to biofilm formation with one or more 
inhibitors of those genes. 
5 In another aspect of the present invention, methods of screening for substances 

that inhibit the genes involved in biofilm formation is also provided. 

In another aspect of the present invention, a polynucleotide library which is useful 
for molecular characterization of a mature bacterial biofilms is also provided. 

In another aspect of the present invention, using the libraries to detect mature 
10 bacterial biofilms is also provided. 

BRIEF DESCRIPTION OF THE DRAWINGS 
A more complete appreciation of the invention and many of the attendant 
advantages thereof will be readily obtained as the same becomes better understood by 
reference to the following detailed description when considered in connection with the 
1 5 accompanying drawings, wherein: 

Figure 1: Function of genes over-expressed in TGI biofilm versus exponential 
growth phase 

This figure summarizes the data presented in Table 3. The genes have been classified 
according to the COGs functional categories annotation system. Large and medium size 
20 numbers indicate the total number of E. coli biofilm-induced genes into each class or 
sub-class of indicated functions. Genes are indicated only when their expression level in 
biofilm differed by at least a two-fold factor (>2). Numbers within brackets indicate the 
rank as over-expressed genes; 1 = most expressed gene in TGI E. coli biofilm. 

25 Figure 2: Correlation of macroarray and quantitative real-time PCR results 
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The calculated macroarray and Q-RT-PCR ratios of the expression of 7 genes in TGI 
biofilm relative to exponential growth phase were log transformed, and values were 
plotted against each other to evaluate their correlation. The correlation coefficient was 
deduced from a linear regression of the plotted values. 

5 

Figure 3: Biofilm phenotype of selected deletion mutants 

Mature biofilm development of E. coli TGI (wt) compared with a selection of deletion 
mutants of genes over-expressed in TGI biofilm. 

A: For each mutant phenotype analysis, the extent of biofilm formation is shown in the 
10 bottom part of the micro-fermenter and on the removable glass slide. A typical 
experiment is shown. 

B: Graphical comparison of biofilm formation relative to wild type from the mutants 
presented in A. Data represents the average of three independent experiments for each 
mutant. The level of biofilm formed by wt TGI biofilm was set to 100%. 

15 

Figure 4: Functional profiling of E. coli biofilm: flow chamber analysis 

A. Spatial distribution of biofilm formation for E. coli TGI and selected TGI deletion 
mutants expressing Gfp. Biofilms were grown in flow chambers. Biofilm development 

20 was monitored by SCLM at the indicated times after inoculation (20 h, 45 h, 70 h, 95 h). 
Micrographs represent simulated three-dimensional images. Images inseted into 70 h and 
95 h of ycfJ correspond to rare area where the biofilm was more developed. 

B. COMSTAT analysis of biofilm structures. Diagrams and standard deviations (numbers 
indicated in the individual columns) of biomass and substrate coverage from biofilms of 

25 E. coli TGI and TGI deletion mutants were determined by the COMSTAT program at 
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four different time points (20 h, 45 h, 70 h, 95 h). Values are means of data from 12 image 
stacks (6 image stacks from two independent channels). The biomass is in the unit 
\im 3 /\im 2 . The substratum coverage values are relative (1 represents total coverage). 

Figure 5: A comparison of biofilm formation capacity of mutants in the E. coli cpx 
and rpoE envelope stress pathways 

Biofilm development comparison of TGI and TGI deletion mutants in micro-fermenters. 
The average of at least four experiments was plotted in the histogram. The level of 
biofilm formed by wt TGI biofilm was set to 100%. 

Figure 6: Comparison of TGI and TGI AcpxP biofilm structure 

Phenotypic analysis of the structure of TGI and TGI AcpxP biofilms grown in 
micro-fermenter. 

A: General view of the bottom part of the fermenter. 

B: Macroscopic biofilm grown on the internal glass slide, removed from the fermenter 
shown in panel A. 

C: Close-up on the biofilm shown in panel B. 

D: transverse section of TGI and TGI A cpxP biofilm. 

E and F: detailed X50 and X 10000 electron micrographs of TGI and TGI cpxP biofilm 
structure. 

Figure 7: COG functional classes for genes under-expressed in TGI biofilm versus 
exponential growth phase. 

This figure summarizes the data presented in Table 4. The genes have been classified 
according to the COGs functional categories annotation system. Large and medium size 
numbers indicate the total number of E. coli genes falling into each class or sub-class of 



function. Genes are indicated only when their expression level in biofilm differed by at 
least a two-fold factor (< 0.5). Numbers within brackets indicate the rank as 
under-expressed genes; 1 = most repressed gene in TGI biofilm. 

5 Figure 8: Functional profiling of mature E. coli biofilm: biofilm formation in 
microfermenters. 

Comparison of mature biofilm development in micro-fermenters of wild type E. coli TGI 
with TGI mutants in the genes found to be induced by over a two-fold factor in TGI 
biofilm. This figure complements the Figure 3. The far right of the panel describes the 
10 analysis of biofilm development of a pspF mutant, a constitutively expressed positive 
regulator of the pspABCDE operon. The data represent the average of at least three 
independent experiments for each mutant. Wild type TGI biofilm formation was set to 
100. 

1 5 Figure 9: Functional profiling of early steps in E. coli biofilm formation 

Comparison of the early adhesion ability of TGI mutants in genes identified as 
over-expressed in mature TGI versus exponential growth phase or analyzed in this study 
as visualized by crystal violet staining in a static microtiter plate-based assay. E. coli TG 
(M63B1 glucose medium supplemented with proline) adheres poorly in this assay. TGI 
20 fimA (boxed) displays an expected reduced early adhesion capacity. Stars (*) correspond 
to TGI mutants with a growth impairment leading to a non meaningful reduction of 
adhesion in this early biofilm assay. 

25 DETAILED DESCRIPTION OF THE INVENTION 
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The formation of biofilms results in a major lifestyle switch that is thought to 
affect the expression of multiple genes and operons. Using DNA arrays to study the 
global effect of biofilm formation on gene expression, the inventors have demonstrated 
that in biofilms, 1 .9% of the genes showed a consistent up or down-regulation by a factor 
5 greater than two, and that 10% of the E. Coli genome is significantly differentially 

expressed including genes of unknown function, stress-response genes as well as energy 
production and envelope biogenesis functions. The inventors provide evidence that the 
expression of stress envelope response genes, such as the psp operonor elements of the 
cpx pathway, is a general feature of E. coli biofilms. Using gene disruption of 53 of the 
10 genes showed that 17 of the genes are required for the formation of mature biofilm. This 
includes 1 1 genes of previously unknown function. 

Thus, the genes involved in biofilm formation and useful as targets for identifying 
substances that inhibit biofilm formation are those described herein, for example, 
including IctR, recA, mdh, rbsB, rnsrA,finA, tatE, pspF, cpxP, spy, ycfj, ycfR, yoaB, yqcQ 
1 5 yggN, ymcA, yccA, yfcx, yghO, yceP, and ycuB. Preferably, the genes involved in biofilms 
formation are one or more of yccA, (SwissProt accession number-P06967; GenBank 
number- gl787205, the amino acid sequence is shown as SEQ ID NO:299 and the 
nucleotide sequence encoding the protein is shown in SEQ ID NO:300), ycfT, (SwissProt 
accession number- P37796;GenBank number-gl 787353, the amino acid sequence is 
20 shown as SEQ ED NO:301 and the nucleotide sequence encoding the protein is shown in 
SEQ ID NO:302), and yceP, (SwissProt accession number-P75927;GenBank 
number-gl 787299, the amino acid sequence is shown as SEQ ID NO:303 and the 
nucleotide sequence encoding the protein is shown in SEQ ID NO:304). 

As used herein, the term "polynucleotide" refers to a polymer of RNA or DNA 
25 that is single-stranded, optionally containing synthetic, non-natural or altered nucleotide 



bases. A polynucleotide in the form of a polymer of DNA may be comprises of one or 
more segments of cDNA, genomic DNA or synthetic DNA. 

The term "subsequence" refers to a sequence of nucleic acids that comprise a part 
of a longer sequence of nucleic acids. 
5 In a further embodiment of the invention, the proteins are at least 70%, preferably 

at least 80%, more preferably at least 90% identical to the sequences identified above. In 
another embodiment, the genes and thus gene products that are to be inhibited are 
encoded by polynucleotide sequence with at least 70%, preferably 80%, more preferably 
at least 90%, 95%, and 97% identity to the sequences described above, these 
10 polynucleotides will hybridize under stringent conditions to the coding or non-coding 
polynucleotide sequence above. Preferably, these homologous sequences would have the 
same or similar activity to the sequences specifically identified above. 

The terms "stringent conditions" or "stringent hybridization conditions" includes 
reference to conditions under which a polynucleotide will hybridize to its target sequence, 
15 to a detectably greater degree than other sequences (e.g., at least 2-fold over background). 
Stringent conditions will be those where hybridization in 50% formamide, 1 M NaCl, 1% 
SDS at 37°C, and a wash in 0.1X SSC at 60 to 65°C (see Tijssen, Laboratory Techniques 
in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid Probes, Part I, 
Chapter 2 "Overview of principles of hybridization and the strategy of nucleic acid probe 
20 assays", Elsevier, New York (1993); and Current Protocols in Molecular Biology, 
Chapter 2, Ausubel, et al„ Eds., Greene Publishing and Wiley-Interscience, New York 
(1995)). Amino acid and polynucleotide identity, homology and/or similarity can be 
determined using the BLAST algorithm. Preferably, these homologous sequences would 
have the same or similar activity to the sequences specifically identified above. 



8 



In one embodiment, the present invention provides methods of reducing, 
inhibiting, ameliorating, and/or treating bacterial biofilms, such as E. coli biofilms, by 
inhibiting, reducing, and/or attenuating the genes and/or gene products, e.g, messenger 
RNA and proteins encoded thereby, described herein as being involved in biofilm 
5 formation. 

By "treating" is meant the slowing, interrupting, arresting or stopping of the 
progression of the biofilm growth and does not necessarily require the complete 
elimination of the biofilm. "Preventing" or "ameliorating" is intended to include the 
prophylaxis of the biofilm development and/or growth, wherein "prophylaxis" is 
10 understood to be any degree of inhibition on the biofilm development and/or growth, 
including, but not limited to, the complete prevention of biofilm development and/or 
growth. The substances which inhibit the gene(s) described herein are collectively 
termed "biofilm inhibitor(s)." In one embodiment, the biofilm inhibitor(s) decrease the 
ability of the biofilm to develop and/or mature at least by 1%. In another embodiment, 
15 the decrease is at least by 5%, 10%, 1 5%, 20%, 30%, 35%, 40%, etc. 

To effectuate the inhibition of biofilms, a surface, and/or sample (collectively 
termed "at least one substrate") on which a biofilm has begun to develop can be contacted 
with one or more of the biofilm inhibitors thereby inhibiting the biofilm formation. In an 
alternative embodiment, the at least one substrate on which a biofilm has already formed 
20 or developed can be contacted with one or more of the biofilm inhibitors such that biofilm 
becomes less prevalent or completely disappears from the substrate. In an alternative 
embodiment, the at least one substrate in which a biofilm has not begun to develop but is 
susceptible to biofilm formation can be pretreated with one or more of the biofilm 
inhibitors to inhibit the formation of the biofilm on the at least one substrate. The 
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substrate as used herein refers to any surface, liquid or solid, on which a biofilm develops, 
has developed, or is susceptible to biofilm formation. 

The biofilm inhibitors can be any substance, chemical, and/or biological materials 
that inhibit the development and/or formation of the biofilm in an appreciable manner as 
5 described herein. For example, antibodies that specifically bind to and inhibit the activity 
of proteins that are encoded by the genes described herein can be used to inhibit the 
development and/or formation of the biofilm. Polyclonal, monoclonal and/or fragments 
(e.g., Fab fragments) of antibodies that specifically bind to the proteins of the genes 
described herein may be used so long as they inhibit the function of the gene products 
10 according to the disclosure herein. Obtaining polyclonal, monoclonal and/or functional 
fragments thereof is conventional and is described, for example, in Harlow and Lane 
"Using Antibodies: A Laboratory Manual" © Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York (1999). 

An effective amount of the inhibitors as described herein can be used either 
1 5 singularly or in combination and should be used in an amount that results in some 

inhibition of biofilm development and/or growth When the inhibitors are administered in 
combination, they may be premixed, administered simultaneously, or administered singly 
in series. 

In another aspect of the present invention, methods to identify substances, agents, 
20 compounds and/or chemicals (collectively termed "inhibitors") that reduce, inhibit, 

ameliorate and/or treat the development of and/or formation of biofilms. Such methods 
are preferably accomplished by targeting one or more of the genes involved in biofilm 
development as described herein. In one embodiment of this aspect, the gene (or genes) 
are expressed in host cell, preferably a bacterial cell such as E. coli, and the ability of the 
25 inhibitor to affect the gene and/or protein are assessed. For example, levels of 
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transcript can be measured using convention* DNA and/or RNA probing techniques, 
sucb as PCR and other hybridization assays. Thus, the ecu expressing the one or more 
protein encoded by .he genes described herein is contacted with the inhibitor and me 
reiative >eve. of transcription is measured in relation to the cen before contacting with the 
5 inhibitor; and/or compared to a ce„ which simdariy expresses the pro.ein(s, and which 
was no, contacted with the inhibitor. In a similar manner, ieveis of protein expressed m 
ce,, can be assessed, comparing contacted and uncounted ce„s, using protein anaiyhca! 
techniques known in the art. 

Screening for the inhibitors can also be accomplished by testing the effects of Ore 
,0 inhibit) on the deveiopmen, and/or growth of the biofilm as described herein. Once 
identified, the inhibitors can be used to reduce, inhibit, ameiiorate and/or treat the 
development of and/or formation of bioftlms as described herein. 

The inhibitors may be formulated or combined with any acceptable carrier, such 
as buffered saline or other buffered solution. 

„ another aspect of the invention, a polyrtuc.eotide library is provided that is 
nsefit, in the mo,ecu.ar characterization of a mature bacteria, bio fll m, which comprises a 
poo! of polynucleotide sequences or subsequences thereof wherein said sequences or 
subsequences are overexpressed in mature bacteria, biofilm, 
sequences or subsequences may be immobilized on a so,id support in order to form a 

ionic interaction, hydrophobic interaction or otherwise. The so.id support can be a ny.on 
ntembrane.g^ssHde.glassheads.ancUorasiHconchip.Thus.inanotherembodiment.a 
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polynucleotide array is provided which is usefir, «o detect a mature bacteria, biofihn and 
which comprises an immobilized polynucleotide library as described above. 
T.e immobilized polynucleotide tibrary and array can be nsed for delecting differentially 
expressed polynucleotide sequences which are specific* corre,a,ed with a mature 
5 bacteria, biofilms. In Ibis method a polynucleotide sample ia obtained, and .abeled by 
reacting .he polynucleotide sample win, a labeled probe immobilized on a solid support 
wherein said probe comprises any of ,he po,ynuc,eo,ide sequences of tire polynucleotide 

sequences; and deeding a polynucleotide sample reaction product The method can be 
,0 used for de.ee.ing maurre bacteria, biofilms, such as, an EscUericHia coH biofilm. 

„ anolher embodimen. of tire method, a con.ro, polynuc.eotide sample, which is 
1 abe,ed,isemp 1 oyedforeomparing. h camoun.ofpo,ynue,eotidesan 1 p.ereac.ionproduc, 

t0 th e amoun. of tire con.ro, sample reaction product. In another embodiment of tire 
method, 'KNA or mRNA is iso.ated from the polynuc.eotide sample, and which maybe 
15 reverse transcribed to yield a cDNA molecule. 

Th e labeling reaction can be performed by hybridizing the polynucleotide sample 
with *. .abeleti probe. Tne labe, can be radioactive, colorimetric, enzymatic, molecular 
^hfieation.bioluminescentorfluorescent. Detection can then be performed as lotown 



in the art 
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h anolher embodiment, where tire ptoduc, encoder! by any of the polynucleotide 
fences or subsequences is emp.oyed, .he detection can be based on a reeep.or-ligand 



reaction. 



„ anotiter aspee, of the present invention, a method of detecting signiftcant.y 
„essed genes correlated with a mahtre bacteria, bioftims can be performed. As 
25 used herein, "significant^ overexposed" means that the gene or express.on product 



detected is expressed in by a factor of 2 or greater compared to a bacterial cell which is 
not in a biofilm or begun to develop biofilms characteristics. This method comprises 
detecting at least one polynucleotide sequence or subsequence of a polynucleotide library 
as described above or detecting at least one product encoded by said polynucleotide 
5 library in a sample obtained from a patient. In another embodiment of this method, an 
amount of the at least one polynucleotide sequence or subsequence or product encoded by 
said polynucleotide sequence is compared with an amount of the polynucleotide sequence 
or subsequence or product encoded by said polynucleotide sequence or subsequence 
obtained from a control sample. Extracted mRNA may also be used, which can be 
10 reverse transcribed into a cDNA molecule. In another embodiment of this method, the at 
least one polynucleotide sequence or subsequence can be hybridized with mRNA or 
cDNA from the polynucleotide sample using, for example, the labeling and detection 
described above. In another embodiment, of this method where the product encoded by 
any of the polynucleotide sequences or subsequences is employed, the detection can be 
1 5 based on a receptor-ligand reaction. 

Preferably, the sequences or subsequences correspond substantially to the 
polynucleotide sequences of the following genes : me, IctR, dinl, glpQ, mdh, sixA, lamB, 
rbsB, gadA, pspA, pspB, pspC, pspD, tatE. cpxP, rseA, rpoE, spy, yebE, yqcC, yfcX, yjbO, 
yceP, sn&ygiB. In another embodiment, the library further comprises polynucleotide 
20 sequences or subsequences thereof of the following genes: recA, msrA, fimA, pspF, ycfJ, 
ycfR, yoaB, yggN, yneA, yccA, yghO. In another embodiment, the library further 
comprises polynucleotide sequences or subsequences thereof of the following genes: 
RplY, recA, cyoD, sucA.fdhF, cyoC, nifU, sucD, sfsA, nifS.fadB, ucpA.ftsL, sulA, eco, 
msrA, P spD,fimA,fimI, pspE, pspF, cutC, sodC, rseB, ycfJ, ycfR, yoaB, yhhY, yggN, yneA, 
25 ybeD, ydcl, yddL, yccA, yrdD, ybjF, yihN, 1228, yc/L, yiaH, yqeC. 
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In another embodiment, the library further comprises polynucleotide sequences 
or subsequences thereof of the following genes: lysU, miaA, rluC, rplY, crl, cspD, dniR, 
fruR, idnR, lad, nac, rnk, rpoS, ttk, b0299, dinG, dinP, exo, intA, recA, recN, sbmC, xthA, 
aceA, aceB, aldA, atpA, cyoA, cyoC, cyoD, dctA, fdhF, fdoG, glpD, glpK, ni/U, pckA, 
sdhB, sdhD, sucA, sucB, sucD, xdhD, agp, gcd, glgS, glpX, malE, malF, malS, mglA, 
mglB, mrsA, pgm, rbsC, rbsD, sfsA, ansB, argC, argR, idnD, leuD, metH, nifS, putP, 
metK.pnuC, ubiE,fabA,fadB,fadE,fadL,pgpA,pssA, uppS, idnO, ucpA.ftsL, sulA, dnaJ, 
dnaK, eco.fkpA, glnE, htpG, htpX, msrA, amiB, ddg, fhiA, fimA, fiml, htrL. lepB, mraW, 
nlpB, nlpC, ompC, ompG, pspE, pspF, chaA, chaC, cutC, cysP, cysU.fur, modA, modB, 
modC, modE, sodC, trkH, rseB, ycfJ, ycfR, yoaB, yhhY, yggN, yneA, ybeD, ydcl, yddL, 
yccA, yrdD, ybjF, yihN, ycfT, yeeF, yfiE, yeeD, yliH, yfcM, ybiX, yfhF/nifA, ygfQ, ybhR, 
ybdH, yihR, ydcT, ygiS, ybaZ, ydaM, tfaR, yceL, yheT, yjdC, ybiW, ybiF, ynal, yceE, yhdP, 
ygjE, csiE, yfdE, yeeE, yegQ, glcA, yfdW, yfeT, ygjK, ydeW, M228, ycjL, yghO, yiaH, 
yqeC ycJT, yhjJ, yceB, ybiX,ygiQ, yagV, yoeA, ybhQ, ybcI, ybbF. ybgl, yncH, yjbM, yjiM, 
yjfO, ychN, ynaC, ymJE, yfcN, yrbC, yfdQ, yfeY, ygiM, yhgA, yhjQ, yfcF, yfel, yjiD, yjbP, 
yphB, yfbN, ylbH, ybhM, yrbL, yjfY, ynfA, yajl, yedl, yafZ, yjjU, yfliH, yqfN, yrbE, yfgC, 
yfjQ. ycaK, yfeS, b4250, ybgA, yeeA, ypfl, b2394, yegK, ybcJ, yhiN, ypfG, ydiY, yjjJ, ycaP, 

yfgJ- 

In a preferred embodiment of the library, the biofilms is a Escherichia coli 
biofilms. 
EXAMPLES 

Experimental procedures 
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Bacterial strains and culture conditions 

Bacterial strains used in this work are described in Table 2. All experiments were 
performed in 0.4% glucose M63B1 minimal medium at 37°C except flow chamber 
experiments that were performed at 30°C in 0,02% glucose FAB minimal medium. 
Proline was added at 400 ug/ml for TG growth. 



Early adhesion and biofilm formation assay 

Microtiter plate assays were performed as described in (OToole, G. A., and Kolter, R. 
(1998) Mol Microbiol 30: 295-304). Biofilm development comparisons in aerated 
micro-fermenters were conducted as described in (Ghigo, J.M. (2001) Nature 412: 
442-445.). The biofilms formed on the removable glass slide were photographed and then 
resuspended in 10 ml of M63B1 minimal medium. The optical density at 600 nm (OD 6 oo) 
of the resuspension was then measured. After 24 hours the average resuspended E. coli 
TGI biofilm biomass reached OD 60 o= 5. Each mutant was tested in at least 3 independent 
experiments alongside with the control strain TGI. 

Macroarray analysis 

Genomic expression profiles were performed on E. coli TGI and TG strains grown in 
0.4% glucose M63B1 at 37°C either as planktonic cultures or mature biofilms. Planktonic 
cultures were realized in agitated Erlenmeyer flasks (main experiment) or aerated 
micro-fermenters, both in exponential phase OD 6 oo~0.6 or stationary phase OD 600 ~3. 
Mature biofilms were grown in aerated micro-fermenters (8 and 5 day old biofilms for 
TGI and TG respectively). For all conditions, the equivalent of 15 OD 60 o of bacterial cells 
was collected. The cells were then broken in a Fast Prep apparatus (Bio 101). Total RNA 
was extracted by Trizol (Gibco-BRL) treatment. Genomic DNA was degraded using the 



DNA-/ree™ kit (Ambion). Radioactive* labeled cDNAs, generated by using E. coU Kl 2 
CDS-speoincprimer S (S.GMA.GenoSys),we r eb y b ri d i zed.o £ .co«K12panora m agene 

a^ays containing duplicated spots for eaeb of .be 4,290 predicted E. coU K12 ORFs 
(SI GMA.Ge„oSys). Tbe intensity of eaeh dot was quantified witb ft. XDOTSREADER 
software (Cose) as describee in (Hotnmais, (2001) Mo, Micro** 40: 20-36). 
Experiments were carried ou, using three independent RNA preparations of TGI 
p.ank.onic flask cultures versus TGI biofiun. For tbe F free TG experiment and the TG. 
p.anktonic femten.er versus TG. bioftlm experiments, two independent RNA 
preparations were used. Eacb hybridization with eacb independent sample wascarried on, 
3 widt 1 pg and 10 pg of total RNA. Comparison of me signa, intensity of anays from 
d up.ica,e S or from independent hybridizations showed fha. .he resuUs were higb.y 
reproducible (data not shown). 

Statistical analysis of the Macroarmy data 

15 Genes fha, were statistic* significant* over- and under-expressed were identified 
using the non-paramefric Wi.coxon rank sum fest. For each gene, .he expression in E. coL 
TGI flask exponentia, and ,atio„ary p.ank.onic cultures (n=.0 and n-12, respectively), 
TG flask P .ank.onic cu.tures (n-4), TGI fermen.er p.anktonic cu.ture (n- 4) and TGI 
bioftlm (n-.O) or TG biof.lm (n-4) were compared. Ana,yses were performed with one 
20 ,ai,ed .ests. Genes were considered ,o be statistic^ signif.can.ly over- or 
under-expressed when p<0.05. Low (less man 0.0.) or negauve ,eve,s of expression were 
removed from the analysis. 
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Disruption of genes identified through macroarray analysis 

fimA, msrA, recA, cpxA and pspF mutants were transferred to TGI by PI transduction. 
For the other genes, a non-polar mutation that deletes the entire target gene from the 
5 initiation to the stop codon, was created by allelic exchange with the non-polar aphA gene 
cassette from Tn903. We used a 3-step PCR procedure as described in (Chaveroche, M.K., 
Ghigo, J.M., and d'Enfert, C. (2000) Nucleic Acids Res 28: E97; Derbise, A., (2003) 
FEMS Immunol Med Microbiol 38: 113-116) and detailed at previously. 

The primers used to inactivate the 54 genes presented in this study, as well as nlpE and 
10 cpxR genes, are described in Table 6. 

Quantitative RT-PCR 

Quantitative reverse transcription PCR (Q-RT-PCR) was used to confirm the DNA 
- macroarray data. Total_RNAs_used for macroassay were used for real-time PCR and 
15 RT-PCR. PCR and RT-PCR were performed using a light-cycler (Roche Diagnostics). 
The RNA preparation was subjected twice to DNase I (Roche Diagnostics) treatment for 
30 min at room temperature to remove any contaminating genomic DNA. The enzyme 
was then inactivated 15 min at 65°C in the presence of 2.5 mM EDTA. Samples were 
checked for residual genomic DNA by real-time PCR using the cpxP-RT-5 and 
20 cpxP-RT-3 primers (see Table 7). Reactions were performed in a 20 ul reaction volume 
using LightCycler FastStart DNA master SYBR Green I (Roche Diagnostics) according 
to the manufacturer's instructions. RNA samples were considered to be free of genomic 
DNA if no amplification was detected after at least 35 cycles of amplification. 
Quantitative RT-PCR reactions were performed twice with two independent RNA 
25 preparations and using primers specifics for several biofilm up-regulated genes (see Table 

17 



' 7)orro » W naS rD NApHn, eI s(™ 1 , 5 --ATO A CC A GCCACACrOGAAC.3-(SEQ I D 
NO ,97) an* TM2, 5--CrrCCrCCCCO CT OAAAOTA-3- (SBQ ID N0.9S), wH. 50 

^of^RNA^eacK^ons-p-eQu— onof^Ao^aS^AC. 
5 ^wasaoneusinsRNA^erSVBRG.een^ochelMa^acco^g^c 

r « o^o1p PPR nroduct was confirmed by 
m anufacturer's instructions. Amplification of a angle PGR prod 

fusion curve analysis and electrophoresis on 2% agarose gels. 

Construction ofGFP-tagged strains 

10 ^ sMi n *»* was W — a. *e ,a« * - a **-3 

, , (C— M.K., C Wg o, ,M„ an, C. (.000, **. ^* 

te 2S- B 97 ; D e*.se, A.. (2003, «- Microbiol 3S: ..HW «— 

pZH U. 0fp Sa, - a * - CO. Oue, * — - «— « * 

15 L H3: 33-3S, „ — * *e — . * P— — 

into TGlgfp. 



Flow chamber experiments 
20 BiofltaS we re o uW v a ,e d a t 30.C i n^a me . fl owce 1 U Wi . i n^c h a m « 1 

^-,■.4K«,-**••--"-•^---" ,,,i -•- ,,-, 
a«W CC~n - « Methods 3.0= »4* A — *- 

slip (Kntte, 24 x 50 nun stl; Kntte. «-> « - " — * 



cover si 



" ^ , Allows- 16-20 h old overnight cultures in LB supplemented 

Inocula were prepared as follows. 10 zu n u 



18 



with the appropriate antibiotics were harvested and resuspended in 0.9% NaCl. 250 uL of 
OD 600 - normalized dilutions in 0.9% NaCl (OD 6 oo=0.05) were injected into each flow 
channel after medium flow was arrested. Flow was started 1 h after inoculation at a 
constant rate of 3 mL h" 1 using a Watson Marlow 205S peristaltic pump. 

Microscopy and image analysis 

Biofilm development in micro-fermenters was recorded with a Nikon Coolpix 950 
digital camera. Transmission and scanning laser electronic microscopy were performed 
on biofilm grown in micro-fermenters on thermanox slides (Nalgene) attached to the 
internal removable glass slide and treated as described in (Prigent-Combaret et al, 2000, 
JBacteriol 181: 5993-6002.). 

For flow chamber experiments, microscopic observations and image acquisitions 
were performed on a Zeiss LSM510 Scanning Confocal Laser Microscope (Carl Zeiss, 
Jena, Germany). Images were obtained using a 40x/1.3 Plan-Neofluar oil objective. 
Simulated three-dimensional images were generated by using the JMARIS software 
package (Bitplane AG, Zurich, Switzerland). Images were further processed for display 
using Adobe Photoshop. For COMSTAT analysis (Heydorn et al, 2000, Microbiology 
■ 146 ( Pt 10): 2395-2407) and quantification of the E. coli biofilm development with the 
wild type and the different mutants, each strain was grown in two separate channels, and 
six image stacks were acquired randomly down through each channel at different time 
points (20 h, 45 h, 70 h and 95 h after inoculation). 
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Results 



Production of mature E. coli biofilms 

The capacity of different E. coli K12 strains to form mature biofilms was tested in 
5 M63B1 -glucose minimal medium in a micro-fermenter-based continuous flow culture 
system (Ghigo, 2001, Nature 412: 442-445.)- Most of the strains tested formed only thin 
biofilms after 2 to 5 days. However, high biomass and thick biofilm production (> 200 
uM) was reproducibly achieved using E. coli TGI, a strain carrying the F conjugative 
plasmid previously shown to promote biofilm formation (Ghigo, 2001, Nature 412: 
10 442-445.; Reisner et al, 2003, Mol Microbiol 48: 933-946). To identify E. coli genes that 
are differentially expressed in mature biofilms, we compared 8 day-old TGI biofilms to 
late exponential TGI planktonic (OD= 0.6) or stationary phase cultures (OD= 3). 
Whereas in agitated flask and planktonic culture conditions, no surface adhesion was 
observed, a significant amount of contaminating biofilm formation occurred in 
15 planktonic TGI continuous cultures grown in fermenters. This led us, in the main 
experiment described in this study, to compare planktonic cultures grown in agitated 
flasks to TGI biofilms grown in fermenters. However, differential gene expression 
between planktonic and biofilm bacteria both grown in fermenters was also investigated 
(see discussion). 

20 

Biofilm formation has a global impact on gene expression when compared to 
exponential growth phase 

Total RNAs were isolated from independent biofilm and exponential growth phase 
cultures and subjected to a stringent expression profiling procedure using E. coli 
25 membrane DNA macroarrays. Data were subjected to a Wilcoxon rank test. The 
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expression patiem and predie.ed fi.ne.ion of differentia,!, expressed genes are 
sunned in Fig. 1 and Fig. 7. h, biofitas, 250 genes (5.8 %) were over-expressed (p< 
0 05, 82% of then, with p<0.0O5) whereas .88 genes (4.4%) were under-expressed 
(p<0.05, 85% of them with p<0.005). This indiea.es ,ha. 10.2 % of .he E. co/f genome is 
5 differentia,* expressed in TGI biofihn a. a siatistieafiy signif.can. .eve. (Fig. ., 7 and 
Tah,e 3 and 4). Among .hese identified genes, ..9 % were np or down-regu.a.ed by a 

factor of two-fold or more. 

The mos. signifiean. e.asses of hiofihn-indueed genes when eompared «o .he 
p,ank.o»ie exponentia, grow.fi phase ei,her by .eve, of over-expression or by number are 
,0 i) genes invoived in ee„u,ar proeesses sueh as enve,ope stress-responses ipspABCDE, 
cpxP spy, rpoE, rseA, rse B) and stress (recA, *Q as we,, as ee„ enve,ope biogenesis 
and transport 0M **>. «> -« invoWed in energy icyoD. sucA, s*A. *V> and 
camohydrate metaboUe funetions UtmB) and Hi) genes of unknown fi.ne.ion 

(48 %) (Fig. 1)- 

15 The main Casses of repressed genes ineh.de genes invoWed in amino aetd, 
earhohydrate transport and inorganie ion transport and genes of unknown fimetion (F.g. 7 
mi T ab,e 4). in .he re, of this study, we foeus on genes that were found to be the mos, 
over-expressed in E. coU biofihns. The ro,e and signifieanee of the repressed fi.net.ons 
will be reported elsewhere. 

20 

Bot U senary pnase ana ^specific g enes arc m -« »+- 

Mature biofims eonstitute heterogeneous environment where baeieria grow at 
Afferent rates. This fie.erogenei,y is proposed to be mostiy dependent on nuhien, 
avai,abi,i.y and depm-re.ated eonditions ereated within me biofihn. We wished .0 
25 d e,rmine.owfiatex,en.,hegenesiden.i fi edaboveweretiu,ybiof.,m-speeif.eor,ins.ead, 
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a consequence of the stationary phase-like conditions prevailing in the mature biofilm. 
Total RNAs were isolated from independent stationary phase planktonic cultures, 
subjected to the expression profiling procedure and compared to biofilm profiling 
(complete comparison is published). Among the 64 genes found to be the most induced in 
biofilm versus exponential phase ( >two-fold ratio, see Fig. 1), 61% (39/64) of them were 
not induced in biofilm when compared to stationary phase (Table 1). This suggests that 

these 39 genes are not biofilm-specific, but may, instead, reflect the stationary phase-like 

growth conditions within the mature E. coli biofilm. 

In contrast, 39% (25/64) of the remaining genes were also over-expressed in biofilm 

versus stationary growth phase, 24 of which with a ratio 2>2, thus defining a set of 

biofilm-specific genes (Table 1 and Table 5). 

Validation of the macroarray data 

Several approaches were used to validate the data issued from transcriptional 
profiling experiments. We checked the correlation between expression data and operons 
structure in E coli. An analysis restricted to the genes with known function found to be 
induced by at least a two-fold factor in biofilm compared to exponentially grown cells 
showed that 51 % of them (21/41) were predicted to be included in 14 different operons, 
using the EcoCyc Database. For 10 of these 14 operons, we identified at least two 
members of the operon whose expression was induced in biofilms compared to 
exponentially grown cells. Furthermore, in order to verify the expression level changes, 
we then performed a Quantitative RT-PCR analysis (Q-RT-PCR) on a selection of the 
biofilm growth-regulated genes. Q-RT-PCR was performed for 7 of the most 
biofilm-induced genes compared to exponentially grown cells (cpxP, ycfJ, ycfR, yebE, 
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cyoD, sucA and JimA, see Fig. . and Tab.e 1). Fig. 2 shows a good cone,a«on between a. 

data obtained by the two different techniques (i- 1.12). 

These resuhs indicate bo* a god interna, consistency of our macroarray data as wel. 
. a good corre,a«ion between our ana,ysis and actua, mRNA >eve., as experimemaUy 
determine, by Q-RT-PCR. To ex.ec, further functiona. information front our 
DNA-atray data, we then wished to anaiyze the biofunwelateu phenotypes of isogentc 
mutants of the identified biofllm-induced genes. 



,0 Functional „ro fi Un g - *+- » — — " 

mature biofilm development 

^ong genes signiftcandy induced in TGI biofi.ms (when compared to p.anx,onic 
exponent growm phase ce„s), 64 genes were found to be over-expressed by a, leas, a 
factor of «wo (Tab.e 1). To test direcdy the contribution of these genes ,o biofilm 
15 d eve,opmen,, we de.eted 23 of the 2 5 genes that were over-expresseo in b,ofi lm s 

,ha, were omy induced in biofdms versus exponentia, grow* phase. Mutations in s*A, 

^ *w inm. *o (* »** »" ™ A ,mpaired "° wth in M63BI 

gl „cose minima, medium (data no, shown). Mu,an K in these genes, a,o„g with/rs,, an 
20 essentia, ce„ division gene, cou.d no, be meamngfuUy tested for biofihn formation and 

nations can be suppressed by exogenic muumons (De Las Penas - a,, ,997, J 
Bacteri o, 179: 6S62-6S64). Ahhough our ^ mutan, did no, exhibit ml. wi.d-.ype 
^h we canno, exc.ude me appearance of such suppressor mu,a,ions in mis mutant 
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The ability to form a mature biofilm within 24 hours was assessed for each mutant and 
compared to TGI. Both macroscopic biofilm development in micro-fermenters and 
biofilm cell density after dispersion of the biofilm grown on the removable glass slide of 
the fermenter were examined. Twenty mutants displayed a reduced biofilm phenotype 
5 (see Table 1, Fig. 3 and Fig. 8). Nine of the mutants with reduced biofilm biomass 
correspond to genes of known function: fimA, msrA, rbsB, mdh, IctR, tatE, recA, cpxP and 
spy. 

fimA, msrA, rbsB and mdh are genes encoding proteins that have been already linked 
1 0 to biofilm formation or adhesion properties (see above). As expected, adhesion appeared 
to be a key factor of TGI biofilm formation. Indeed, fimA encodes for the major subunit 
of type I fimbriae, a known initial adhesion factor (Klemm and Christiansen, 1987, Mol 
Gen Genet 208: 439-445) whose role has been previously demonstrated in biofilm 
formation (Austin et al, 1998, F EMS Microbiol Lett 162: 295-301; Cookson etal, 2002, 
15 IntJ Med Microbiol 292 : 1 95-205 ; Cormio et al , 1 996, Scand J Urol Nephrol 30: 1 9-24; 
Pratt and Kolter, 1998, Mol Microbiol 30: 285-293; Watnick et al, 1999, J Bacterial 181: 
3606-3609). In contrast with our results, Reisner et al. recently showed that a fimA 
mutation had no effect on the development of biofilms formed in flow chambers by a F 
plasmid-bearing E. coli strain (Reisner et al, 2003, Mol Microbiol 48: 933-946). 
20 Differences in strain, medium and biofilm growing system used might account for this 
discrepancy. msrA encodes a peptide methionine sulfoxide reductase (MsrA), a repair 
enzyme, that contributes to the maintenance of adhesins in Streptococcus pneumoniae, 
Neisseria gonorrhoeae, E. coli (Wizemann et al, 1996, Proc Natl Acad Sci U S A 93: 
7985-7990) and in Mycoplasma genitalium (Dhandayuthapani et al, 2001, J Bacteriol 
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183: 5645-5650), which could explain the alteration of biofilm formation in the msrA 
mutant. 

The biofilm lifestyle leads to a profound modification of energy metabolism as judged 
by the identification of mdh, rbsB and IctR as biofilm-induced genes. The rbsB and mdh 
5 genes have been already identified as being over-expressed in biofilms formed by 
pathogenic E. coli (Tremoulet et al, 2002, FEMS Microbiol Lett 215: 7-14). rbsB is part 
of the rbsDACBK operon that encodes high affinity transport of and chemotaxis towards 
D-ribose (rbsC and rbsD are also induced in biofilm, see Table 3). mdh encodes malate 
dehydrogenase, an enzyme of the TCA cycle. The IctR gene encodes for a regulator of 
10 L-Lactate dehydrogenase. Furthermore, several sugar metabolism/transport systems are 
activated in biofilm (maltose transport, glycerol metabolism and uptake, galactose 
binding proteins, see Table 3). 

Our results also suggest that mature E. coli biofilm formation might require 
Tat-dependent secretion of a specific set of proteins. Indeed, tatE is proposed to be 
1 5 involved in the twin-arginine cell envelope protein transport system (Chanal et al. , 1 998, 
Mol Microbiol 30: 674-676). In P. aeruginosa, tatA and tatB, encoding components of 
this secretion system, have been shown to be induced in biofilms (Whiteley et al., 2001, 
Nature 413: 860-864), whereas tatC have been shown to be required for biofilm 
formation (Ochsner et al. , 2002, Proc Natl Acad Sci U S A 99: 83 1 2-83 1 7). 
20 We also observed a defect in mature biofilm formation in a recA mutant (Fig. 3). This 

underlines the importance of stress-responses in E. coli TGI biofilm. Consistent with this 
result, several stress-response genes are over-expressed in TGI biofilm (SOS response: 
dinl dinP, dinG, sbmC, recN, sulA; general stress: rpoS; chaperones: dnaJ and dnaK; 
heat-shock proteins: htpX, htpG and ddg; DNA repair: exo, xthA and envelope stress: see 
25 Table 3 and below). cpxP and spy are both linked to envelope stress response (Connolly et 
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ai, 1997, Genes Dev 11: 2012-2021; Danese and Silhavy, 1997, Genes Dev 11: 
1183-1193; Raivio and Silhavy, 2001, Annu Rev Microbiol 55: 591-624) and will be 
investigated below. 

We could also assign a biofilm-related function to 11 genes of previously unknown 
5 function (ycfJ, ycfR, yoaB, yqcC, yggN, yneA, yccA, yfcX, yghO, yceP and ygiB). YfcX 
may be required for fatty acid utilization as a carbon source in anaerobic conditions 
(Campbell et ai, 2003, Mol Microbiol 47: 793-805). Among these 11 genes, 5 encode 
putative extra-cytoplasmic proteins (ycfJ,ycjR, yqcC,yneA,yccA). YcfJ is homologous to 
UmoD of P. mirabilis, a protein that negatively regulates the jlhDC flagellar and 
10 swarming master operon (Dufour et ai, 1998, Mol Microbiol 29: 741-751). yccA is a 
putative c/p*-regulon member (De Wulf et ai, 2002, J Biol Chem 111: 26652-26661) 
encoding a protein of unknown function but it has been shown to be a substrate for the 
membrane protease FtsH (Kihara et al, 1998, J Mol Biol 279: 175-188). Among the 
mutants lacking any one of these five putative extra-cytoplasmic proteins, AycfJ and 
1 5 AyccA were the most affected for mature biofilm formation, with a reduction of about 
50% compared to wild type strain TGI (Fig. 8). 

To investigate the biofilm-related role of these two putative membrane proteins 
further and to confirm their importance in mature biofilm formation, we genetically 
introduced the Green Fluorescent Protein (GFP) gene into the wild type strain TGI, and 
20 in the mutant strains TGlAvc/J and TG\ AyccA. This allowed us to compare biofilm 
formation between TGlgfp and TGlgfpAycfJ and TGlgfpAyccA in continuous flow 
chamber cultures, another well established experimental model that is a non-invasive 
means of observing where the spatial arrangement of the cells is preserved. This 
experimental system allows the quantitative, real-time monitoring of biofilm architecture 
25 development using Confocal Laser Scanning Microscopy and COMSTAT analysis 
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(Heydom et al, 2000, Microbiology 146 ( Pt 10): 2395-2407) (Fig. 4). Initial adhesion of 
the two ycfJ and yccA mutants was not affected, as measured by substrate coverage and 
biomass analysis. However, the maturation of the biofilm formed by these two mutants 
was strongly delayed, especially for the yccA mutant. Indeed, in the yccA mutant, the 
accumulated biomass remained very low over time and typical biofilm mushroom 
structures appeared only sporadically and much later compared to wild type strain TGI 
(see Fig. 4). This suggests a role of YcfJ and YccA proteins in biofilm maturation. 

These results demonstrate the involvement in mature biofilm formation of 30% of 
the most highly expressed genes identified in our study. 50 % of these genes (10/20) were 
induced in biofilm versus both exponential and stationary growth phase (cpxP, spy, tatE, 
IctR, mdh, rbsB, ygiB, yqcC, yceP and yfcX) whereas the other 50 % (10/20) were only 
induced in biofilm versus exponential growth phase (fimA, msrA, recA, yoaB, ycfJ, ycfR, 
yneA, yccA, yggN and yghO) (see Table 1 and Table 5). 

Biofilm-induced genes are not involved in the early stage of biofilm formation 

A failure to form a wild type mature biofilm could result from an initial adhesion 
defect. Therefore, we investigated whether the genes identified as over-expressed in 
mature TGI biofilms and that impaired mature biofilm formation when mutated were also 
involved in the early adhesion steps. For this, we tested this mutants in a static microtiter 
plate-based assay that has been widely used to study the first steps of biofilm formation 
(Genevaux et al., 1996, FEMS Microbiol Lett 142: 27-30; OToole et al, 1999, Methods 
Enzymol 310: 91-109). With the exception of fimA, the early adhesion capacity of the 
mutants could not be distinguished from the parental strain (Fig. 9). This result indicates 
that most genes over-expressed in mature biofilms are not involved in the early steps of 
this process and confirms that they participate in mature biofilm functions. 

27 



Comparison of E. coli F ¥ /F r biofilm global response: general relevance to E. coli 
biofilm 

In this study, we used an E. coli strain carrying a conjugative plasmid, a widespread 
5 situation which promotes biofilm formation (Ghigo, 2001 , Nature 412: 442-445; Reisner 
et al. 9 2003., Mol Microbiol 48: 933-946). To distinguish general features of E. coli 
biofilms from those specific to our model, we analyzed the transcription profile of the E. 
coli strain TG, an F-free isogenic derivative of TGI. This control is of particular 
relevance because some of the genes found to be the most over-expressed (pspA, cpxP) 
10 have either been shown to be related to the conjugation process {cpx stands for 
conjugation plasmid expression (McEwen and Silverman, 1980, Proc Natl Acad Sci U S 
A 11: 513-517) or to stress-responses that could correlate with the expression of 
membrane appendages such as conjugative pili. TG forms a thin and fragile biofilm after 
5 days of culture in micro-fermenters (data not shown). Total RNA was isolated from E. 
1 5 coli TG biofilm and flask planktonic exponential cultures, and was subjected to the same 
macroarray analysis as described for TGI. TGI and TG biofilms were not strictly 
comparable in terms of depth and structure (and therefore, possibly, for biofilm-induced 
responses). As expected, some functions induced in TGI, for instance RecA and part of 
the SOS stress pathway, were not induced in TG (Table 1), suggesting that F-specific, 
20 possibly transfer-related, responses are induced in TGI biofilm. Despite this fact, 33% of 
the genes induced in TGI biofilm by an over two-fold factor were also found to be 
statistically significantly over-expressed in TG biofilm (including cpxP, rseA, rseB, spy, 
psp operon members, tatE, and fimA, see Table 1). This demonstrates that many of the 
biofilm-induced genes identified in this study are F-independent and part of a general E. 
25 coli Kl 2 biofilm response. 
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Envelope stress pathways in E. coli mature biofilm 

cpxP is one the most over-expressed genes in E. coli TGI biofilms versus planktonic 
growth phase (Fig. 1, Table 1 and Table 5). cpxP is a target of the cpx two-component 
5 system, which is known to respond to a variety of extra-cytoplasmic stress (envelope 
stress) (Raivio and Silhavy, 2001, Annu Rev Microbiol 55: 591-624). 

We therefore investigated the effect of deletion mutations in key components of the 
cpx pathway on biofilm formation. As shown in Figure 5, inactivation of the 
sensor-regulator components of the cpx system (cpxA, cpxR), but also of cpxP and of nlpE 
10 affected biofilm formation in micro-fermenters. A mutation in spy (a biofilm-induced 
cpxP homolog) has no effect on biofilm biomass. rpoE and rseA mutants displayed a 
growth rate defect and consequently could not be studied in micro-fermenters. A 
mutation in rseB, the second anti-sigma E factor of the RpoE envelope stress pathway, 
did not affect growth and a rseB mutant formed a wild type biofilm. Whereas it is difficult 
15 to conclude that the rpoE pathway has a role in biofilm formation, the cpx pathway 
appears to contribute to biofilm development, based on the morphological effects caused 
by mutations in several of its key components. Indeed, the biofilms produced by both 
TGlAcpxR and TGI AcpxP in micro-fermenters were very fragile compared to wild type 
TGI biofilms. TGlAcpxP biofilm was made of large plaques, in strong contrast to the 
20 homogeneous TGI biofilm (Fig. 6 ABC). Consistent with this observation, a detailed 
electron microscopy analysis revealed that a cpxP mutation strongly altered biofilm 
macromorphology (Fig. 6DE). Despite its fragility, no clear structural defect could be 
detected in the TGlAcpxR biofilm (data not shown). Even though slight structural 
differences could also be seen in the TGlAspv mutant biofilms, structural alterations were 
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not found in nlpE, cpxA nor rseB mutant biofilms grown in micro-fermenters (data not 
shown). 

To further investigate the role of cpxP and cpxR, we introduced a gfp allele into 
TGlAcpxP and TGlAcpxR and we compared their biofilm formation to the parental 

5 TGlgfp strain in continuous flow chamber cultures. Single cells and very small colonies 
were observed on the surface for these two mutants during the initial steps of biofilm 
development in contrast to the wild-type that forms normal three-dimensional colonies 
(Fig. 4, 20 and 45 h). Furthermore, both cpxP and cpxR mutants were also strongly 
affected for maturation of the biofilm (Fig. 4). These experiments suggest that stress 

10 envelope pathways are involved in the establishment of a structured mature biofilm in E. 
coli. 

Phage-shock protein operon [psp) is expressed in response to a variety of 
environmental and intracellular stresses including processes related to protein insertion in 
the outer membrane (Weiner and Model, 1994, ProcNatlA cad Sci USA 91: 2191-2195). 

15 While the precise functions of the psp genes are not understood, they help to ensure 
survival of E. coli in adverse conditions, suggesting that psp genes are part of a 
stress-response operon (Model et al, 1997, Mol Microbiol 24: 255-261). In our analysis, 
pspA and other members of the operon (pspBCDE) were consistently over-expressed in 
biofilm (Fig. 1, Table 3 and Table 5). Nevertheless, the disruption of the pspABCDE 

20 operon did not have a major impact on early (Fig. 9) or late biofilm formation nor on 
biofilm structure (data not shown). 
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Discussion 



ta th U study we invest the differences in gene expression between * *>« K.2 
mature biofilm and planktonic laboratory cultures. Using DNA macroarrays we showed 
that to b.ofilm li^le, wbile sharing — ies witb the stationary .rowth phase, 



triggers 



the expression of specific sets of genes. 
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of the E. coli genes are differentially expressed by more than a factor of two. However, 
below that threshold, biofilm formation still leads to the statistically significant 
differential expression of more than 10% of the E. coli genome. These results therefore 
support the proposal that biofilm formation results in and from significant differences in 
5 the overall make-up of bacterial cells (Sauer, 2003, Genome Biol 4: 219; Stoodley et al, 
2002, Annu Rev Microbiol 56: 187-209). 

Mature biofilm cells have been proposed to have stationary growth phase traits such 
as reduced growth and metabolic activity. To investigate the stationary phase character of 
bacterial life within biofilm, we also compared the expression pattern of stationary phase 
10 cultures with those determined for the exponential growth phase and the mature biofilm. 
Biofilm-specific genes, i.e. genes differentially regulated in biofilm versus both forms of 
planktonic phases, correspond to 4% of the genome (118 over- and 53 under-expressed/ 
4290) and this proportion decreases to less than 1% (0.67%, 23 over and 6 under/4290) 
for genes varying by a factor of more than two. When one only considers the genes 
1 5 induced in response to the stationary growth phase character of the biofilm lifestyle, these 
genes represent 3% of the genome. The biofilm lifestyle, while sharing similarities with 
the stationary growth phase, thus triggers the expression of specific sets of genes. 

Functional profiling of the biofilm-induced genes 

20 The biological importance of the differential gene expression exhibited upon biofilm 

versus planktonic growth was tested by the disruption of the majority of the 
highly-induced genes in biofilms, including all biofilm-specific induced genes. We show 
that, while the mutants were not impaired in initial steps of adhesion to surfaces (with the 
exception of fimA), a third of them (20 genes) were affected in the biofilm maturation 

25 (Table 1, Fig. 3 and Fig. 8). This high proportion of genes involved in the biofilm 
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maturation strongly supports the pertinence of our analysis. Among these 20 genes, half 
spond to biofilm-specific genes whereas the other half was only induced in biofilms 
xponential growth phase (see Table 1 and Table 5). This indicates that the 
development of a full mature biofilm requires not only biofilm-specific genes but also 
genes related to the stationary phase character of the biofilm. The individual role of some 
of these newly identified genes is currently being investigated. 



Biofilm-related physiological functions 

We show that genes found to be the most over-expressed in TGI biofilm versus 
exponential growth phase were also part of the E. coli F-free biofilm response, therefore 
indicating that genes identified in this study are involved in the general response 
developed in mature E. coli K12 biofilms. Those genes are not distributed randomly into 
all potential functional classes. Instead they display a strong bias toward specific 
-functional-categories and .we. propose that they are part of the biofilm genetic signature. 
15 Genes whose expression is required for full maturation of TGI biofilm belong to 
functions linked to adhesion (fimA, msrA), energy metabolism (rbsB, mdh, IctR), 
transport (tatE), general stress {recA), and envelope stress response (cpxP and spy). 
However, it is likely that many genes identified in our study are not specifically involved 
in biofilm-specific functions but rather correspond to adaptive responses to the biofilm 
20 environment. Mutations in many biofilm-induced genes that also correspond to 
information storage and processing, metabolism, cellular processes and unknown 
functions have indeed no effect on TGI biofilm formation (Table 1). 

Moreover, 48% of the genes significantly over-expressed in biofilms versus 
exponential growth phase were of uncharacterized function. Compared to 19.6% of such 
25 genes found in the E. coli genome (Serres et al, 2001 , Genome Biol 2: RESE ARCH003 5), 
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this high proportion of genes of unknown functions expressed in mature biofilm suggests 
that new aspects of E. coli biology are adopted during biofilm formation. We show that 1 1 
of these uncharacterized genes are necessary for full mature biofilm formation, thus 
experimentally assigning them a biofilm-related function (Table 1, Fig. 3 and Fig. 8). 
5 Among them 5 encode putative membrane proteins that could be of particular relevance 
when considering the importance of envelope-related physiology within a biofilm. 

Consistent with the drastic phenotypic changes occurring inside biofilms, we found 
that 15 % of the genes identified as over- or under-expressed in biofilms versus 
exponential growth phase are involved in either energy processes or carbohydrate 
10 metabolism (Fig. 1 , Table 1 and 3). Despite the presence of polysaccharides in the TGI 
biofilm (data not shown), we could not clearly associate the expression of any of those 
genes with the production of the biofilm matrix {i.e., cellulose, colanic acid). This could 
reflect, among other explanations, a lack of sensitivity of our approach due to the 
averaging occurring while extracting transcription information from the heterogeneous 

1 5 bacterial biofilm population. 

A partial comparison of the most over-expressed genes in our analysis (>2 fold 
factor) and in the study by Schembri et al. (> 8 fold factor) only revealed a few genes 
identified as over-expressed in E. coli biofilm in both studies (rbsB, b0836, yfjO, yceP, 
glgS, ydeW, yneA, yqeC, ylcC, rplV, rplD, rpsS, M550, rplP, rpsR.flu, rplM, ppc, oppA, 
20 gatD, cydA, atpB, rpsN, malK, atpG) (Schembri et al, 2003, Mol Microbiol 48: 253-267). 
Three of these genes (rbsB, yceP, yneA) were nevertheless also found here to be required 
for mature biofilm formation. This relatively low overlap between the two studies may be 
due to technical differences. Different scenarios were used in terms of strain background, 
media and experimental set-up. This could also reflect the difference in the gene 
25 expression pattern between two biofilms at very different stages of maturation (i.e. young 
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(Whiteley et al, 2001, Nature 413: 860-864). As stresses are known to induce prophages 
and other mobile genetic elements, our results suggest that Pseudomonas prophage 
induction may be a consequence of stresses created by the drastic conditions that prevail 
inside the biofilm. As such, stress may well be a key factor in the mechanisms that lead to 
5 the observed antibiotic resistance inside biofilm communities. 

Owing to the possible role of cell-cell and cell-surface interactions in biofilm, it may 
be of significance that envelope stress genes such as cpxP, spy and the psp genes are 
consistently induced in this environment. CpxP may inhibit the cpx-mediated induction 
through a direct interaction with the two-component system sensor CpxA, while Spy may 
10 play a similar role on the rpoE pathway (Raivio et al, 2000,Mo/ Microbiol 37: 
1186-1197). The cpx system is known to respond to envelope stresses such as 
over-production and misfolding of membrane proteins or elevated pH (Raivio and 
Silhavy, 2001, Annu Rev Microbiol 55: 591-624). However, relatively little is known 
about the physiological role of envelope stress-responses. Recently, adhesion of E. coli 
1 5 cells to hydrophobic but not hydrophilic surfaces was shown to activate the cpx system, 
including cpxP, through a process called surface sensing which requires both cpxR and 
nlpE(Ot\o and Silhavy, 2002, Proc Natl Acad Sci USA 99: 2287-2292). Consistently, we 
find that cpxP and spy are highly induced in mature biofilms where bacteria are de facto 
in contact with the hydrophobic surfaces of other cells. 
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Our results thus provide additional experimental evidence that stress response 
pathways are key factors in biofilm formation. The structure of biofilms grown in 
micro-fermenters is altered in a cpxP mutant (Fig. 6) and to a lesser extent in a spy mutant. 
Observation of spy mutant biofilms by transmission electron microscopy also revealed a 
25 high proportion of spheroblasts as compared to wt TGI (data not shown), suggesting a 



36 



^ a cp** nru<a„, are ho* impaired in forming ^ type micro-coionies (Fig. 4). This 
strongly corrooora.es ,he idea .ha, c P ,P and cp,* — have reduced ceiHo-cen 

5 fOT ces of .he flow. 1, appears, .en, rha. .he inappropriaie expressron of .he „ resumed 

• „f the rnx reeulon in the c/wP mutant or an absence of 
genes in biofilm, i.e. a derepression of the c^regulon 

biofllm fomra,ion. Considering .he impose of en— colons in hioflhu 
• ™ „„ m0 o„e„,sys,ems,whiehse„ S epe rt nr b a,ionsorehangesin,hebac,ena 1 
formation, two component sysieina, 

, tnrv role in bacterial biofilm formation, a proposal that 
10 environment, might play a regulatory role in 

requires further investigation. 

Ouran^Uiden.if.ed.hehiofi.^odeofgrow.haaan— „, ,ha, .nducea 

fc evasion of *e ~ opero. However no *-H— 

ps „ m igh, aiso regu,a,e a hio—ed iocua .ha, is no, parr of ^ opero, 

;lee for such an addr.ona, ^ regnia.ed ,arge, has heen ^ ^ 
,r flD5D re g ulon(DaminandMiller,2001,Mo/M I cro^39.429 444). 
Yersinia enterohtica psp reguion ^« 

20 

Chan.es in g ene expression and biofilm development 

expression demonstrated here and in other studies could be 
The changes in gene expression oen 

ca uaed h y ,he eondhiona progress erea,ed whhin ,he hioflhn during ,,a — 



37 



is a developmental process in which genetic checkpoints could control the maturation of 
the biofilm by inducing a succession of biofilm-specific genes. Whereas 8 mutations out 
of 54 mutants created in this study display a 50% decrease in biofilm biomass and 
maturation, none of them lead to a total loss of biofilm formation. Considering the 
existence of multiple and partially overlapping or complementing pathways that can lead 
to biofilm formation, this result, without formally excluding the existence of a biofilm 
developmental program, rather speaks in favor of the second working hypothesis. In this 
case, most changes observed in biofilm gene induction could be a consequence of, rather 
than a prerequisite for the biofilm maturation. 
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The results presented here provide new insights into the global effect triggered by 
biofilm formation in E. coli. By monitoring the changes in gene expression occurring in 
mature biofilms, we have identified biofilm-related physiological pathways and 
previously-uncharacterized biofilm-induced genes. This may lead to new biofilm control 
15 strategies that will likely hinge upon a better understanding of biofilm-induced 
physiological responses. 

Discussion 
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In this study we investigated the differences in gene expression between E. coli K12 
mature biofilm and planktonic laboratory cultures. Using DNA macroarrays we showed 
that the biofilm lifestyle, while sharing similarities with the stationary growth phase, 
triggers the expression of specific sets of genes. 



25 



Modifications ofE. coli K12 gene expression induce by the biofilm lifestyle 
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The use of large scale fusion technology had already suggested that a significant 
fraction of the bacterial genome could be involved in biofilm physiology 
(Prigent-Combaret et al, 1999, J Bacteriol 181: 5993-6002). Accordingly, P. putida and 
P. aeruginosa biofilm proteome analyses showed that a large number of genes are 
5 differentially regulated during biofilm development (Sauer and Camper, 2001, J 
Bacteriol 183: 6579-6589; Sauer et al, 2002, J Bacteriol 184: 1 140-1 154). In contrast, a 
transciption profiling of the P. aeruginosa planktonic and biofilm phases led to the 
conclusion that only 1% off. aeruginosa genes display over a two-fold difference in 
gene expression (Whiteley et al, 2001, Nature 413: 860-864). 
10 In E. coli, Shembri et al recently showed that approximately 5 to 10 % of the E. coli 

genes exhibited altered microarray expression profiles when compared planktonic growth 
phases and young biofilm cultures. They hypothesized that this could be due to the rather 
early stages of biofilm development analyzed in their study, where the still ongoing 
switch from planktonic to sessile growth could result in a high level of transient gene 
15 expression (Schembri et al, 2003, Mol Microbiol 48: 253-267). 

Here, we compared mature biofilms to the planktonic exponential growth phase and 
showed that, as in the case of mature P. aeruginosa biofilms, only a small fraction (1 .9 %) 
of the E. coli genes are differentially expressed by more than a factor of two. However, 
below that threshold, biofilm formation still leads to the statistically significant 
20 differential expression of more than 1 0% of the E. coli genome. These results therefore 
support the proposal that biofilm formation results in and from significant differences in 
the overall make-up of bacterial cells (Sauer, 2003, Genome Biol 4: 219; Stoodley et al., 
2002, Annu Rev Microbiol 56: 187-209). 
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Mature biofilm cells have been proposed to have stationary growth phase traits such 
as reduced growth and metabolic activity. To investigate the stationary phase character of 
bacterial life within biofilm, we also compared the expression pattern of stationary phase 
cultures with those determined for the exponential growth phase and the mature biofilm. 

5 Biofilm-specific genes, i.e. genes differentially regulated in biofilm versus both forms of 
planktonic phases, correspond to 4% of the genome (118 over- and 53 under-expressed/ 
4290) and this proportion decreases to less than 1% (0.67%, 23 over and 6 under/4290) 
for genes varying by a factor of more than two. When one only considers the genes 
induced in response to the stationary growth phase character of the biofilm lifestyle, these 

10 genes represent 3% of the genome. The biofilm lifestyle, while sharing similarities with 
the stationary growth phase, thus triggers the expression of specific sets of genes. 

Functional profiling of the biofilm-induced genes 

The biological_importance of the differential gene expression exhibited upon biofilm 

15 versus planktonic growth was tested by the disruption of the majority of the 
highly-induced genes in biofilms, including all biofilm-specific induced genes. We show 
that, while the mutants were not impaired in initial steps of adhesion to surfaces (with the 
exception of fimA\ a third of them (20 genes) were affected in the biofilm maturation 
(Table 1, Fig. 3 and Fig. 8). This high proportion of genes involved in the biofilm 

20 maturation strongly supports the pertinence of our analysis. Among these 20 genes, half 
correspond to biofilm-specific genes whereas the other half was only induced in biofilms 
versus exponential growth phase (see Table 1 and Table 5). This indicates that the 
development of a full mature biofilm requires not only biofilm-specific genes but also 
genes related to the stationary phase character of the biofilm. The individual role of some 

25 of these newly identified genes is currently being investigated. 

40 
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Consistent with the drastic phenotypic changes occurring inside biofilms, we found 
that 15 % of the genes identified as over- or under-expressed in biofilms versus 
exponential growth phase are involved in either energy processes or carbohydrate 
metabolism (Fig. 1, Table 1 and 3). Despite the presence of polysaccharides in the TGI 
5 biofilm (data not shown), we could not clearly associate the expression of any of those 
genes with the production of the biofilm matrix (i.e., cellulose, colanic acid). This could 
reflect, among other explanations, a lack of sensitivity of our approach due to the 
averaging occurring while extracting transcription information from the heterogeneous 
bacterial biofilm population. 
10 A partial comparison of the most over-expressed genes in our analysis (>2 fold 

factor) and in the study by Schembri et al. (> 8 fold factor) only revealed a few genes 
identified as over-expressed in E. coli biofilm in both studies (rbsB, b0836, yfiO, yceP, 
glgS, ydeW, yneA, yqeC, ylcC, rplV, rplD, rpsS, M550, rplP, rpsR.flu, rplM, ppc, oppA, 
gatD, cydA, atpB, rpsN, malK, atpG) (Schembri et al, 2003, Mol Microbiol 48: 253-267). 
1 5 Three of these genes (rbsB, yceP, yneA) were nevertheless also found here to be required 
for mature biofilm formation. This relatively low overlap between the two studies may be 
due to technical differences. Different scenarios were used in terms of strain background, 
media and experimental set-up. This could also reflect the difference in the gene 
expression pattern between two biofilms at very different stages of maturation (i.e. young 
20 and thin biofilms in Schembri et al. versus mature and thick biofilms in our study). 
Further studies comparing the expression profile of E. coli biofilms at different 
maturation stages within the same experimental set-up will provide a more dynamic view 
of biofilm gene expression. 

25 Heterogeneity of oxygen conditions in E. coli K12 biofilms 
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Biofilms are heterogeneous environments and, with respect to aerobiosis, our analysis 
supports these results. In the main experiment described in this study, we compared 
exponentially grown agitated flask cultures to TGI biofilm in aerated conditions. Under 
these conditions, numerous genes known to be induced by aerobiosis were also induced in 
5 biofilms, including some genes for TCA cycle enzymes (e.g. aceB, cyo operon members, 
fadB, mdh, glpD, sucAB). In addition, some genes known to be repressed by aerobiosis 
were repressed in biofilms (eg. adhE, cydAB, dcuC,focA,fumB). This tends to indicate 
that our biofilms were mainly grown under aerobic conditions. Consequently, we also 
compared differential gene expression between TGI biofilms and TGI planktonic 
10 cultures, both grown in aerated fermenters (data not shown). In this configuration, we 
clearly observed that some typical aerobic genes were induced in biofilms whereas others 
were repressed. This was also the case for typical anaerobic genes. This could reflect the 
heterogeneity of the aerobic conditions in biofilms, in which external bacteria are in 
eontact-with oxygen while internal bacteria are in conditions close to anaerobiosis. 
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Stress-responses in biofilms 

Our study revealed that a major physiological response to biofilm formation is the 
induction of stress-responses. Interestingly, such a stress-response induction may also 
take place in P. aeruginosa biofilms. Indeed, the most highly activated genes identified in 

20 a P. aeruginosa biofilm transcriptome analysis were those of temperate bacteriophages 
(Whiteley et al, 2001, Nature 413: 860-864). As stresses are known to induce prophages 
and other mobile genetic elements, our results suggest that Pseudomonas prophage 
induction may be a consequence of stresses created by the drastic conditions that prevail 
inside the biofilm. As such, stress may well be a key factor in the mechanisms that lead to 

25 the observed antibiotic resistance inside biofilm communities. 
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Owing to the possible role of cell-cell and cell-surface interactions in biofilm, it may 
be of significance that envelope stress genes such as cpxP, spy and the psp genes are 
consistently induced in this environment. CpxP may inhibit the cpx-mediated induction 
through a direct interaction with the two-component system sensor CpxA, while Spy may 
5 play a similar role on the rpoE pathway (Raivio et al, 2000, Mol Microbiol 37: 
1186-1197). The cpx system is known to respond to envelope stresses such as 
over-production and misfolding of membrane proteins or elevated pH (Raivio and 
Silhavy, 2001, Annu Rev Microbiol 55: 591-624). However, relatively little is known 
about the physiological role of envelope stress-responses. Recently, adhesion of E. coli 
10 cells to hydrophobic but not hydrophilic surfaces was shown to activate the cpx system, 
including cpxP, through a process called surface sensing which requires both cpxR and 
nlpE (Otto and Silhavy, 2002, Proc Natl Acad Sci USA99: 2287-2292). Consistently, we 
find that cpxP and spy are highly induced in mature biofilms where bacteria are de facto 
_in contact with the hydrophobic surfaces of other cells. 
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Our results thus provide additional experimental evidence that stress response 
pathways are key factors in biofilm formation. The structure of biofilms grown in 
micro-fermenters is altered in a cpxP mutant (Fig. 6) and to a lesser extent in a spy mutant. 
Observation of spy mutant biofilms by transmission electron microscopy also revealed a 
20 high proportion of spheroblasts as compared to wt TGI (data not shown), suggesting a 
possible cause for the affected structure of the biofilm in this mutant. In addition, a cpxP 
and a cpxR mutant are both impaired in forming wild type micro-colonies (Fig. 4). This 
strongly corroborates the idea that cpxP and cpxR mutants have reduced cell-to-cell 
adherence, since any growth up in the water column will be counteracted by the shearing 
25 forces of the flow. It appears, then, that the inappropriate expression of the cpx regulated 
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to biofilm formation, this result, without formally excluding the existence of a biofilm 
developmental program, rather speaks in favor of the second working hypothesis. In this 
case, most changes observed in biofilm gene induction could be a consequence of, rather 
than a prerequisite for the biofilm maturation. 

The results presented here provide new insights into the global effect triggered by 
biofilm formation in E. coli. By monitoring the changes in gene expression occurring in 
mature biofilms, we have identified biofilm-related physiological pathways and 
previously uncharacterized biofilm-induced genes. This may lead to new biofilm control 
strategies that will likely hinge upon a better understanding of biofilm-induced 
physiological responses. 
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Genes 

a 



Ratio 

c 



Rank Phenotype TG 

d e f 



Function - description 

S 



INFORMATION STORAGE AND PROCESSING 
J '.Translation, ribosomal structure and metabolism 



rplY* b2185 2.23 52 



rne bl084 2.06 
K : Transcription 



58 



IctR 



b3604 4.76 14 



L : DNA replication, recombination and repair 



recA* 
dinf 



b2699 
bl061 



2.30 
2.02 



METABOLISM 

C : Energy production and conversion 

cyoD* 

b0429 7.41 3 wt 

b0726 6.54 7 NA 

b4079 3.85 23 wt 

b0430 3.59 26 wt 



ND 5 OS ribosomal subunit protein L25 

RNase E, mRNA turnover, maturation 5S 
ND RNA 

Regulator of L-Lactate dehydrogenase 
genes 

DNA strand exchange and renaturation. 
51 - SOS 

61 wt Inhibits RecA-mediated self-cleavage. SOS 



sucA* 
fdhF* 
cyoC* 



nifU* 

sixA 

sucD* 

g , P Q* 



b2529 
b2340 
b0729 

b2239 
b3236 



3.41 
2.74 
2.69 

2.50 
2.19 



27 
41 
43 

46 
53 



NA 
wt 
NA 

ND 



mdh 

G : Carbohydrate transport and metabolism 



lamB b4036 2.94 36 



wt 



rbsB n b3751 2.41 48 
sfsA* b0146 1.97 64 ND 
E : Amino acid transport and metabolism 

gad/ b3517 3.15 30 wt 
nifS* b2530 1.98 62 NA 
/ ; Lipid metabolism 



20 



wt 



Cytochrome o oxidase subunit IV 

2-oxoglutarate dehydrogenase 

S subunit of formate dehydrogenase H 

Cytochrome o oxidase subunit III 

Formation of [fe-s] clusters in iron-sulfur 

proteins 

<sf Phosphatase affecting ArcB phosphorelay 
Succinyl-CoA synthetase, alpha subunit 
Glycerol-3 -phosphate diesterase, 
periplasmic 
Malate dehydrogenase 

Phage lambda receptor, maltose receptor 
D-ribose periplasmic binding protein, 
chemotaxis 

Regulatory protein for maltose metabolism 

yf Glutamate decarboxylase isozyme 
Cysteine desulfurase 

Fatty acid oxidation complex; 4-enzyme 
protein 



fadB* b3846 4.18 

Q : Secondary metabolites biosynthesis, transport and metabolism 

Oxido reductase, dehydrogenase/reductase 
ucpA* b2426 2.32 50 ND family . 



CELLULAR PROCESSES 

D : Cell division and chromosomal partitionning 



ftsL* b0083 4.34 17 



NA 



Cell division and growth, septum 
localization 



sulA* b0958 3.07 33 wt 47 Inhibits cell division. SOS 

O : PosMranslational modification, protein turnover, chaperones 

Ecotin, a periplasmic serine protease 
b2209 6.17 9 wt inhibitor 



eco* 



Table 1. Over-expressed genes (2>2) in E. coli TGI and TG biofilms versus 
exponential growth phase 



a : Gene names according to E. coli Colibri database. 
5 b : Gene names according to Blattner nomenclature. 

c : Ratio of gene expression in E. coli biofilm versus gene expression in planktonic 
cultures. 

d : Rank position ;1= the most over-expressed gene in E. coli biofilm. 
e : Biofilm phenotype of the mutants : ND : not determined ; NA : not applicable due to 
10 growth defect in M63B1 glucose medium ; wt : similar to wild type ; - : biofilm reduced 
compared to wt ; Struct : biofilm structure impaired compared to wt. 
f : 4 genes also found to be significantly over-expressed in F minus E. coli strain TG. 
g : Function description according to E. coli Colibri database. 

h : pspF was expressed by only a 1 .22 factor in TGI biofilm but has been included for 
1 5 comparison with other members of the psp operon. 
Arrow: mutants affected for biofilm formation. 

* : genes that were not induced in TGI biofilm versus stationary phase. 
#: genes that were also induced in TGI biofilm versus stationary phase by at least a factor 
of two. These genes are also summarized in Table 3S. 
20 The genes have been classified according to the COGs functional categories annotation 

system used by the NCBI. 
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Table 2. Strains and plasmids used in this study 

* Additional individual mutants in the following genes: cutC, cyoC 9 dinl, ecoJadBJdhF, 
gadA, IctR, malM-G, mdh, nifS, nifU, nlpE, pspA-E, rbsB, rpoE, rseB, sixA, sodC, spy, 
sucA, sulA, tatE, ybeD, ybjF, yccA, yceP, ycfJ, ycjL, ycJR, ydcl, yebE, yfcX, yggN, yghO, 
ygiB, yhhY, yiaH, yjbO, yneA, yoaB, yqcC, yqeC, were named TGI U3gene.name]::aphA, 
(Km R ). 



Strain / plasmid 



Relevant characteristics 



Reference / source 
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E. coli strains 
PAP6181 
PHL904 
RG075 
STC27 
TGI 

TG 

TGI AcpxA 
TGI AcpxP 
TGlAcpxR 
TGI AfimA 
TGlAmsrA 
TGI ApspF 
TGlrecA 
TGlArseA 
TGlgfp 
TGlgfp AcpxP 
TGlgfp AcpxR 
TGlgfp by cc A 
TGlgfpAycfJ 
* 

Plasmids 

pKOBEG 

pCP20 

* Additional individual mutants in the following genes: cutC, cyoC, dinl ecoJadBJdhF 
gadA, IctR, malM-G, mdh, nifS, ntfU, nlpE, pspA-E, rbsB, rpoE, rseB sixA sodC, spy, sucA, 
sulA, tatE, ybeD, ybjF, yccA, yceP, ycfJ, ycfL, ycJR, ydcl yebE yfcX, yggNyghO ygiB 
yhhY yiaH, yjbO, yneA, yoaB, yqcC, yqeC, were named TG 1 ADgene. name] : :aphA, (Km ). 



K\5\9pspF::miniTnl0 (Tet R ) 
cpxA w neat (Cm R ) 
MGl655AmsrA::ClSpec (Spec R ) 
fimAlwcat (Cm R ) 

V[traD36 proAB+ lacP lacZAMIS] supE 
hsdAS thi A(lac-proAB) 
A F minus derivative of TGI 
TGlcpxA:: Cleat (Cm R ) 
TGlAcpxP::Afrt 
TGlAcpx/?::Afrt 
TGlAfimA::cat (Cm R ) 
TGI AmsrA::QSpec, (Spec R ) 
TGlpspFwminiTnlO (Tet R ) 
TGlrecA 5 6 SrlC300::Tnl0 (Tet R ) 
TGI Arse AwAfrt 
TG 1 Xatt: :gfp-bla, ( Amp R ) 
TGlAcpxP \M::gfp-bla (Amp R ) 
TGlAcpxR \aU::gfp-bla (Amp R ) 
TGlAyccA Xatt::gfp-bla (Km R , Amp R ) 
TGlAycfJ XM::gfp-bla (Km R , Amp R ) 



(Jovanovic et aL, 1996) 
(Dorel et aL, 1999) 
A gift of F. Ban-as 
(Pratt and Kolter, 1998) 
Laboratory collection 

Laboratory collection 

This work 

This work 

This work 

This work 

This work 

This work 

Laboratory collection 

This work 

A gift of A. Roux 

This work 

This work 

This work 

This work 



pSClOl ts (replicates at 30°C), araC (Chaveroche et al, 

arabinose-inducible X redyfia. operon, (Cm ) 2000) 

ts (replicates at 30°C) plasmid bearing the/7p (Cherepanov and 
recombinase gene. (Cm R and Amp R ) Wackernagel, 1995) 



15 



20 



25 



49 



Table 3. Genes over-expressed in E. coliTGl biofilm versus exponential growth 
phase. 

5 The genes found to be over-expressed at a significant level (P-value <3).05) are indicated. 
They have been classified according to the COGs functional categories annotation 
system. 

a : Gene names according to E. coli Colibri database, 
b : Gene names according to Blattner nomenclature. 
10 c : Ranking position 1= the most over-expressed gene in E. coli biofilm. 

d : Ratio of gene expression in E. coli biofilm versus gene expression in planktonic 
cultures. 

e : Function description according to E. coli Colibri database. 

Arrow: mutants affected for biofilm formation. 

15 m . - 

Function - description 

Genes Rank Bio/Exp 

a b c d e m 



INFORMATION STORAGE AND PROCESSING 
/ ; Translation, ribosomal structure and metabolism 



lysU 
miaA 


b4129 


146 


1.45 


Lysine tRNA synthetase 


b4171 


79 


1.84 


Delta(2)-isopentenylpyrophosphate tRNA-adenosine transferase 


rluC 


bl086 


166 


1.35 


Ribosomal large subunit pseudouridine synthase C 


rne 


bl084 


58 


2.06 


RNase E 


rplY 


b2185 


52 


2.23 


50S ribosomal subunit protein L25 


K: Transcription 






Transcriptional regulator of genes for curli 


crl 


b0240 


81 


1.83 


cspD 


b088O 


135 


1.50 


Cold shock protein 


dniR 


b0211 


199 


1.24 


Transcriptional regulator for nitrite reductase 


fruR 


b0080 


133 


1.50 


Transcriptional repressor of fru operon and others 


idnR 


b4264 


227 


1.18 


L-idonate transcriptional regulator 


lad 


b0345 


170 


1.34 


Transcriptional repressor of the lac operon 


^IctR 


b3604 


14 


4.76 


Regulatory protein for L-Lactate dehydrogenase genes 


nac 


bl988 


105 


1.66 


Nitrogen assimilation control protein 


rnk 


b0610 


126 


1.53 


Regulator of nucleoside diphosphate kinase 


rpoS 


b2741 


88 


1.78 


RNA polymerase sigma S factor 


ttk 


b3641 


134 


1.50 


Putative transcriptional regulator 


L : DNA 


replication. 


recombination and repair 


b0299 


b0299 


245 


1.13 


I S3 putative transposase 


dinG 


b0799 


114 


1.60 


ATP-dependent helicase. SOS 


dinl 


bl061 


61 


2.02 


Inhibits RecA-mediated self-cleavage. SOS 


dinP 


b0231 


82 


1.81 


Putative tRNA synthetase. SOS 


exo 


b2798 


78 


1.84 


5'-3' exonuclease. excision repair 


intA 


b2622 


177 


1.31 


Prophage CP4-57 integrase 


-* recA 


b2699 


51 


2.30 


DNA strand exchange and renaturation. SOS 


recN 


b2616 


237 


1.16 


Recombination and DNA repair. SOS 


sbmC 


b2009 


94 


1.74 


SbmC protein. SOS 


xthA 


bl749 


207 


1.23 


Exonuclease III 


METABOLISM 








C : Energy production and conversion 


Isocitrate lyase 


aceA 


b4015 


195 


1.25 


aceB 


b4014 


120 


1.56 


Malate synthase A 


aldA 


bl415 


67 


1.93 


Aldehyde dehydrogenase. NAD-linked 


atpA 
cyoA 


b3734 


186 


1.28 


Membrane-bound ATP synthase alpha-subunit 


b0432 


89 


1.78 


Cytochrome o ubiquinol oxidase subunit II 



50 



cyoC 

cyoD 

dctA 

fdhF 

fdoG 

glpD 

glpK 

gipQ 

-* mdh 

niJV 
pckA 
sdhB 
sdhD 
six A 
sucA 
sucB 
sucD 
xdhD 



b0430 26 3-59 Cytochrome o ubiquinol oxidase subunit III 

b0429 3 7.41 Cytochrome o ubiquinol oxidase subunit IV 

b3528 74 1 -89 Uptake of C4-dicarboxylic acids 

b4079 23 3.85 Subunit of formate dehydrogenase H. 

b3894 69 1.92 Formate dehydrogenase-O major subunit 

b3426 178 1.31 Sn-glycerol-3 -phosphate dehydrogenase 

b3926 92 1.75 Glycerol kinase 

b2239 46 2.50 Glycerol -3 -phosphate diesterase 

b3236 53 2.19 Malate dehydrogenase 

b2529 27 3 .4 1 Formation/repair of [Fe-S] clusters present in iron-sulfur proteins 

b3403 112 1 .62 Phosphoenolpyruvate carboxykinase 

b0724 1 73 1 -32 Succinate dehydrogenase. Iron sulfur protein 

b0722 131 1 52 Succinate dehydrogenase. Hydrophobic subunit 

b2 340 4 1 2.74 Phosphohistidine phosphatase affecting phosphorelay of ArcB 

b0726 7 6.54 2-oxoglutarate dehydrogenase (decarboxylase) 

b0727 83 1.81 2-oxoglutarate dehydrogenase (dihydrolipoyltranssuccinate) 

b0729 43 2.69 Succinyl-CoA synthetase. Alpha subunit 

b2881 115 1 .59 Putative dehydrogenase 



G : Carbohydrate transport and metabolism 



agp 
gcd 

gigs 

glpX 
lamB 
malE 
malF 
malS 
mglA 
mglB 
mrsA 
pgm 
-+rbsB 
rbsC 
rbsD 
sfsA 



bl002 
b0124 
b3049 
b3925 
b4036 
b4034 
b4033 
b3571 
b2149 
b2150 
b3176 
b0688 
b3751 
b3750 
b3748 
b0146 



85 
197 
168 
140 
36 
91 
87 
164 
215 
66 
111 
149 
48 
159 
154 
64 



1.80 
1.25 
1.35 
1.47 
2.94 
1.76 
1.79 
1.36 
1.21 
1.93 
1.62 
1.45 
2.41 
1.41 
1.42 
1.97 



E : Amino acid transport and metabolism 



ansB 
argC 
argR 
gadA 
idnD 
leuD 
metH 
nifS 
putP 
F 
H 



b2957 
b3958 
b3237 
b3517 
D4267 
b0071 
b4019 
b2530 
b!015 



65 
201 
137 
30 
175 
106 
72 
62 
83 



1.96 
1.24 
1.49 
3.15 
1.31 
1.66 
1.90 
1.98 
1.29 



Nucleotide transport and metabolism 
Coenzyme metabolism 



metK b2942 
pnuC b0751 
ubiE b3833 
J ; Lipid metabolism 



b0954 
b3846 
b0221 
b2344 
b0418 
b2585 
b0174 



110 
169 
220 

127 
21 
84 
181 
102 
239 
156 



1.63 
1.34 
1.20 



1.53 
4.18 
1.80 
1.29 
1.67 
1.15 
1.41 



Periplasmic glucose- 1 -phosphatase 
Glucose dehydrogenase 
Glycogen biosynthesis. rpoS dependent 
Unknown function in glycerol metabolism 
Maltose high-affinity receptor 
Periplasmic maltose-binding protein 
Part of maltose permease 
Alpha-amylase 

ATP-binding galactose-binding transport protein 
Galactose-binding transport protein 

Similar to phosphoglucomutases and phosphomannomutases 
Phosphoglucomutase 

D-ribose periplasmic binding protein, chemotaxis 
D-ribose high-affinity transport system 
D-ribose high-affinity transport system 
Regulatory for maltose metabolism 

Periplasmic L-asparaginase II 
N -acetyl -gamma-glutamylphosphate reductase 
Repressor of arg regulon 
Glutamate decarboxylase isozyme 
L-idonate dehydrogenase 
Isopropylmalate isomerase subunit 
Repressor of metE and metF 
Cysteine desulfurase 
Major sodium/proline symporter 
: none 

Methionine adenosyltransferase 
Required for NMN transport 

Ubiquinone/menaquinone biosynthesis methyltransferase 

Trans-2-decenoyl-ACP isomerase 
Fatty acid oxidation complex. 4-enzyme protein 
Acyl-coenzyme A dehydrogenase 
Transport of long-chain fatty acids 
Phosphatidylglycerophosphatase 
Phosphatidylserine synthase. Phospholipid synthesis 
Undecaprenyl pyrophosphate synthetase (peptidoglycan) 



fabA 
fadB 
fadE 
fadL 

PgP A 
pssA 
uppS 

Q : Secondary metabolites biosynthesis, transport and metabolism 

idnO D4266 148 1.45 5 -keto-D-gluconate 5 -reductase 

A b24 26 50 2.32 Short-chain dehydrogenases/reductases (SDR) family 



CELLULAR PROCESSES 
D: Cell division and chromosomal partitioning 

ftsL b0083 17 4.34 Cell division and growth 

sulA b0958 33 3.07 Inhibits cell division and ftsZ ring formation. SOS 

O : PosUtranslational modification, protein turnover, chaperones 

51 



dnaJ 
dnaK 
eco 
fkpA 
glnE 
htpG 
htpX 

msrA 
M+N : 
amiB 
ddg 
JhiA 
-*- fimA 
fimf 

htrl 

lepB 

mraW 

nlpB 

nlpC 

ompC 

ompG 

pspA 

pspB 

pspC 

pspD 

pspE 

-+pspF 

-+tatE 



bOOl 5 1 63 1 36 Chaperone with DnaK. Heat shock protein 

b0014 107 1.64 Chaperone Hsp70. Heat shock proteins 

b2209 9 6.17 Ecotin. Periplasmic serine protease inhibitor 

b3347 93 1 -74 FKBP-type peptidyl -prolyl cis-trans isomerase (rotamase) 

b3053 1 00 1 .69 Adenylylating enzyme for glutamine synthetase 

b0473 218 1 .20 Chaperone Hsp90. Heat shock protein 

b 1 829 76 1-88 Heat shock protein. Integral membrane protein 

b42 19 22 3.87 Peptide methionine sulfoxide reductase 
Cell envelope biogenesis and secretion 

b4169 243 1.13 N-acetylmuramoyl-l-alanine amidase II. Murein hydrolase 

b2378 1 24 1 .54 Putative heat shock protein 

b0229 71 1-91 Flagellar biosynthesis 

b4314 28 3.29 Major type 1 subunit fimbrin (pilin) 

b43 1 5 49 2 -33 Fimbrial protein 

b361 8 214 1-21 Involved in lipopolysaccharide biosynthesis 

b2568 121 1.56 Leader peptidase (signal peptidase I) 

b0082 238 1.15 Putative apolipoprotein 

b2477 191 1.26 Lipoprotein-34 

bl708 182 1.29 Lipoprotein 

b221 5 1 29 1 52 Outer membrane protein 1 b 

bl319 162 1.37 Outer membrane protein G 

bl 304 2 8.42 Phage shock protein. Inner membrane protein 

bi305 59 2.04 Phage shock protein 

bl 306 ] 2 5.58 Phage shock protein. Activates phage shock-protein expression 

bl307 11 5.61 Phage shock protein 

bl308 47 2.47 Phage shock protein 

bl303 211 1 22 psp operon transcriptional activator 

b0627 57 2. 12 Membrane translocation of folded periplasmic proteins 



P : Inorganic ion transport and metabolism 



bl216 234 1.16 Sodium-calcium/proton antiporter 

bl2 is 70 1.91 Accessory and regulatory protein for chaA 

bl874 24 3 74 Copper homeostasis protein 

b2425 74 1 .89 Thiosulfate binding protein 

b2424 136 1 .49 Thiosulfate transport system permease 

b0683 1 88 1 .27 Ferric iron uptake negative regulator 

b0763 99 1 .70 Molybdate-binding periplasmic protein. Permease 

b0764 223 1.19 Molybdate transport permease protein 

b0765 216 1 .20 ATP -binding component of molybdate transport 

b076 1 139 1 -47 Molybdate uptake regulatory protein 

bl646 32 3.10 Superoxide dismutase precursor (Cu-Zn) 

b3849 226 1.19 Potassium uptake 
' * 1 ; 

b39 ] 4 j 22.9 Suppresses toxic envelope protein effects. CpxA/R activated 

b2573 63 1 98 Extra-cytoplasmic Sigma-E factor 

b2572 55 2.15 Negative regulatory protein of sigma-E factor 

b25 7 \ 29 3.20 Negative regulatory protein of sigma-E factor 

bl743 31 3.13 Periplasmic protein related to spheroblast formation 



chaA 
chaC 
cutC 
cysP 
cysU 
fur 

modA 

modB 

modC 

modE 

sodC 

trkH 

T : Signal transduction mechanism 

cpxP 
rpoE 
rseA 
rseB 
*JE2L 



NOT CHARACTERIZED 



-+ycfJ 

-+ycJR 

yoaB 

yebE 

-» yqcC 

yhhY 

- yggN 

-+yneA 

ybeD 

ydcl 

yddL 

-* yccA 

-yfcX 

yjbO 

yrdD 

ybjF 

yihN 

ycJT 



bl 1 10 


5 


6.95 


bl 112 


8 


6.4 


bl809 


10 


6.03 


bl846 


13 


5.47 


b2792 


15 


4.45 


b3441 


16 


4.35 


b2958 


21 


4.1 


bl516 


25 


3.6 


b0631 


35 


2.97 


bl422 


37 


2.83 


bl472 


38 


2.82 


b0970 


39 


2.76 


b2341 


40 


2.75 


b4050 


44 


2.58 


b3283 


45 


2.54 


b0859 


56 


2.14 


b3874 


60 


2.03 


bins 


68 


1.93 



Similarity to Rickettsia 17 kda surface antigen 
Exported / Outer membrane protein ? 
Putative translation initiation inhibitor 
Similarity to an Y.enterocolitica protein 
Similarity to E.carotovora orfl exoenzyme 
Putative acetyltransferase 
Activated by RpoE 
Putative periplasmic binding protein 

Homology to one histine kinase sensor domain of M. grisea 
Putative transcriptional regulator LysR-type 
Putative outer membrane porin protein 

Putative carrier/transport membrane protein. Degraded by FtsH 

Putative fatty oxidation complex alpha subunit 

Similarity to a putative exported Y. pestis protein 

Putative DNA topoisomerase 

Putative 23S rRNA (uracil-5-)-methyltransferase 

Putative resistance protein (transport) 

Integral membrane protein 

52 



yeeF 

yflE 

yeeD 

yliH 

yfcM 

ybiX 

yfhF/nifA 

ygfQ 

ybhR 

ybdH 

yihR 

ydcT 

ygiS 

ybaZ 

ydaM 

tfaR 

yceL 

yheT 

yjdC 

ybiW 

ybiF 

ynal 

yceE 

yhdP 

ygjE 

csiE 
yfdE 
yeeE 

yegQ 

glcA 

yfdW 

yfeT 

y& K 

ydeW 



b2014 

b2577 

b2012 

b0836 

b2326 

b0804 

b2528 

b2884 

b0792 

b0599 

b3879 

bl441 

b3020 

b0454 

bl341 

bl373 

bl065 

b3353 

b4135 

b0823 

b0813 

bl330 

bl053 

bl657 

b3063 

b2535 

b2371 

b2013 

b2081 

b2975 

b2374 

b2427 

b3080 

bl512 



77 

86 

90 

95 

96 

97 

101 

113 

118 

130 

132 

138 

143 

150 

155 

157 

161 

167 

172 

174 

176 

180 

185 

196 

213 

217 

219 

221 

228 

229 

232 

233 

236 

242 



S : Function unknown 

bl228 
ycjL 
-*yghO 
yiaH 
-+yceP 
yqeC 
UygiB 
ycfT 
yhjJ 
yceB 
ybiX 

ygiQ 

yagV 
yoeA 
ybhQ 
ybcl 
ybbF 
ybgl 
yncH 
yfbM 
yjiM 

yjjo 

ychN 
ynaC 
ymfE 
yfcN 
yrbC 

yfdQ 

yfeY 
ygiM 
yhgA 

yhjQ 



bl228 


4 


bll04 


6 


b2981 


18 


b3561 


19 


bl060 


34 


b2876 


42 


b3037 


54 


blll5 


68 


b3527 


73 


bl063 


80 


b0804 


97 


b3015 


98 


b0289 


103 


bl995 


104 


b0791 


108 


b0527 


109 


b0524 


116 


b0710 


117 


bl455 


119 


b0681 


122 


b4335 


123 


b4189 


125 


b!219 


128 


bl373 


141 


bl 138 


142 


b2331 


144 


b3192 


145 


b2360 


147 


b2432 


151 


b3055 


152 


b3411 


153 


b3534 


158 



1 .84 Putative amino acid/amine transport protein 

1 .79 Putative transcriptional regulator LysR-type 

1 .77 Belongs to the UPF0033 family 

1 .73 Putative receptor 

1 .73 Putative transporting ATPase 

1.73 Putative enzyme 

1 .68 Putative regulator 

1 62 Belongs to YicO/YieG/YjcD family 

1 .58 Simlarity to £. coli YbhS, YhhJ and YhiG. IM protein 

1 .52 Putative oxidoreductase 

1.51 Putative aldose- 1 -epimerase 

1 .47 Putative ATP-binding component of a transport system 

1 .45 Putative transport periplasmic protein 

1 .44 Similarity to Cysteine methyltransferase 

1 41 Contains 1 GGDEF Dufl domain 

1 41 Phage lambda tail fiber gene homolog in prophage Rac 

1 .38 Belongs to the major fator family. Integral Membrane Protein 

1 .35 Belongs to the UPF00 1 7 family 

1 .33 Similarity to S. glaudescens TcmR 

1 .32 Putative formate acetyltransferase 

1 3 1 Putative transmembrane subunit 

1.3 Belongs to the UPF0003 family. Integral membrane protein 

1 .28 Putative transport protein 

1 .25 Putative transport protein 

1.21 Putative tartrate carrier 

1 .2 Stationary phase inducible protein 

1.2 Putative enzyme 

1.19 Putative transport system permease protein 

1.18 Putative peptidase (family U32) 

1.17 Putative permease 

1.17 Putative enzyme 

1.17 Belongs to the Sis family, RpiR subfamily 
1.16 Putative isomerase 

1.13 Putative transcriptional regulator. SorC family 

-7.04 Unknown 

6.70 Unknown 

4.31 Unknown 

4. 1 8 Unknown. Integral membrane protein 
3.06 Unknown 

2.70 Unknown 
2.15 Unknown 

1 .93 Unknown. Integral membrane protein 

1 .90 Unknown 

1.84 Unknown 

1.73 Unknown 

1.71 Unknown 
1 .67 Unknown 
1.66 Unknown 
1 .64 Unknown 
1 .63 Unknown 
1 .59 Unknown 
1.58 Unknown 
1.58 Unknown 
1.56 Unknown 
1 .54 Unknown 
1 .54 Unknown 
1.53 Unknown 
1 .47 Unknown 
1 .46 Unknown 
1 .45 Unknown 
1 .45 Unknown 
1 .45 Unknown 
1 .44 Unknown 
1 .43 Unknown 
1 .43 Unknown 
1.41 Unknown 
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yfcF 


b2301 


160 


yfcl 


b2305 


165 


yjiD 


b4326 


171 


yJbP 


b2275 


179 


yphB 


b2544 


184 


yJbN 


b2273 


187 


ylbH 


b0499 


189 


ybhM 


b0787 


190 


yrbL 


b3207 


192 


yjjy 


b4199 


193 


ynfA 


bl582 


194 


yajl 


b0412 


198 


yedl 


bl958 


200 


yafZ 


b0252 


202 


yjju 


b4377 


203 


yfliH 


b2561 


204 


vafN 


b0232 


205 


yrbE 


b3194 


206 


yfgC 


b2494 


208 


yJjQ 


b2633 


209 


ycaK 


b0901 


210 


yfeS 


b2420 


212 


b4250 


b4250 


222 


vbgA 


b0707 


224 


yeeA 


b2008 


225 




b2474 


230 


b2S94 


b2394 


231 




b2072 


235 


ybcJ 


b0528 


240 


yhiN 


b3492 


241 


ypfG 


b2466 


244 


ydiY 


bl722 


246 


yjjJ 


b4385 


247 


ycaP 


b0906 


248 




b2510 


249 



1 .39 Unknown 

1.35 Unknown 

1.34 Unknown 

1.30 Unknown 

1 .28 Unknown 

1.28 Unknown 

1.27 Unknown 

1 .26 Unknown. Integral membrane protein 

1 .26 Unknown 

1 .25 Unknown 

1 .25 Unknown 

1.25 Unknown 

1 .24 Unknown 

1 .24 Unknown 

1 .24 Unknown 

1 .24 Unknown 

1 .23 Unknown 

1 .23 Unknown 

1 .22 Unknown 

1 .22 Unknown 

1 .22 Unknown 

1 .22 Unknown 

1.19 Unknown 

1.19 Unknown 

1.19 Unknown 

1.17 Unknown 

1.17 I S 1 86 hypothetical protein 

1.16 Unknown 

1.14 Unknown 

1.14 Unknown 

1.13 Unknown 

1.13 Unknown 

1.12 Unknown 

1.11 Unknown 

1.10 Unknown 



5 



10 



15 



54 



Table 4. Genes under-expressed in E. coli TGI biofilm versus exponential growth 
phase. 

The genes found to be under-expressed at a significant level (P-value <0.05) are 
indicated. They have been classified according to the COGs functional categories 
annotation system. 

a : Gene names according to E. coli Colibri database. 

b : Gene names according to Blattner nomenclature. 

c : Rank position 1= the most repressed gene in E. coli biofilm. 

d : Ratio of gene expression in E. coli biofilm versus gene expression in planktonic 

cultures. 

e : Function description according to E. coli Colibri database. 



Genes 
a b 



Rank 



Bio/Exp 
d 



Function - description 



INFORMATION STORAGE AND PROCESSING 



def 

frr 

prfA 

rbfA 

rbn 

rimJ 

rpmG 

rpsV 

serS 

thrS 



b3287 
b0172 
bl211 
b3167 
b3886 
bl066 
b3636 
bl480 
b0893 
bl719 



K: Transcription 

gcvR b2479 
malT b3418 
osmE bl739 
oxyR b3961 
xylR b3569 



dnaG 

holA 

hupB 

intE 

nudG 

uvrD 

xerC 



b3066 
b0640 
b0440 
bll40 
bl759 
b3813 
b3811 



62 


0.67 


Peptide deformylase 


81 


0.70 


Ribosome releasing factor 


166 


0.84 


Peptide chain release factor RF-1 


40 


0.61 


Ribosome-binding factor A 


176 


0.87 


tRNA processing exoribonuclease BN 


88 


0.72 


Acetylation of 30S ribosomal subunit protein S5 


79 


0.69 


5 OS ribosomal subunit protein L33 


161 


0.83 


3 OS ribosomal subunit protein S22 


96 


0.73 


Serine tRNA synthetase 


63 


0.67 


Threonine tRNA synthetase 


69 


0.67 


Transcriptional regulation of gcv operon 


119 


0.77 


Positive regulator of ma/ regulon 


66 


0.67 


Osmotically inducible lipoprotein E 


108 


0.75 


Activator of hydrogen peroxide-inducible genes 


170 


0.86 


Putative regulator of xyl operon 


recombination and repair 


187 


0.92 


DNA primase 


134 


0.79 


DNA polymerase III delta subunit 


41 


0.61 


DNA-binding protein HU-beta 


164 


0.84 


Prophage e 1 4 integrase 


174 


0.87 


CTP pyrophosphohydrolase 


183 


0.91 


DNA-dependent ATPase I and helicase II 


109 


0.76 


Site-specific recombinase 



METABOLISM 

C : Energy production and conversion 

adhE bl241 1 0.23 

aldB b3588 180 0.89 

cydA b0733 93 0.73 

cydB b0734 127 0.78 

dcuC b0621 106 0.75 

fumB b4122 145 0.81 

icdA bl 136 136 0.80 

pjlB b0903 29 0.56 

pta b2297 120 0.77 



Iron-dependent alcohol dehydrogenase 
Aldehyde dehydrogenase 

Cytochrome d terminal oxidase. Polypeptide subunit I 
Cytochrome d terminal oxidase Polypeptide subunit II 
Transport of dicarboxylates 
Fumarase B 

Isocitrate dehydrogenase 
Formate acetyltransferase 1 
Phosphotransacetylase 



G : Carbohydrate transport and metabolism 

bglX b2132 159 0.83 Beta-D-glucoside glucohydrolase 

cmr b0842 158 0.83 Proton motive force efflux pump 
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cpsG 

err 

eno 

jba 

/bp 

fruA 

fruB 

fruK 

gapA 

gpmA 

manY 

nagZ 

pjkA 

Pgk 
ptsl 
sgaH 
sgaU 
shiA 
torT 
tpiA 



b2048 

b2417 

b2779 

b2925 

b4232 

b2167 

b2169 

b2168 

bl779 

b0755 

bl818 

bl 107 

b3916 

b2926 

b2416 

b4196 

b4197 

bl981 

b0994 

b3919 



135 
70 
27 
48 
103 
61 
71 
33 
4 

36 

18 

94 

56 

53 

47 

90 

146 

144 

38 

45 



0.80 

0.68 

0.55 

0.63 

0.75 

0.66 

0.68 

0.57 

0.32 

0.60 

0.47 

0.73 

0.66 

0.65 

0.62 

0.72 

0.82 

0.81 

0.60 

0.62 



E : Amino acid transport and metabolism 



arcC 


b0521 


80 


0.69 


argF 


b0273 


102 


0.74 


aroG 


b0754 


11 


0.43 


aroH 


bl704 


143 


0.81 


asnB 


b0674 


148 


0.82 


edd 


bl851 


162 


0.84 


ggt 


b3447 


186 


0.92 


glnA 


b3870 


58 


0.66 


glnB 


b2553 


20 


0.49 


hisB 


b2022 


25 


0.53 


hisC 


b2021 


17 


0.47 


hisG 


b2019 


165 


0.84 


hisl 


b2026 


44 


0.62 


ilvL 


b3766 


14 


0.45 


oppA 


bl243 


42 


0.61 


oppB 


bl244 


74 


0.68 


oppC 


bl245 


129 


0.78 


oppD 


bl246 


65 


0.67 


oppF 


bl247 


172 


0.86 


pepQ 


b3847 


101 


0.74 


trpA 


bl260 


72 


0.68 


trpB 


bl261 


39 


0.60 


F : Nucleotide transport and metabolism 


cyaA 


b3806 


43 


0.61 


hpt 


b0125 


59 


0.66 


tdk 


bl238 


113 


0.77 


H : Coenzyme metabolism 


0.77 


bioH 


b3412 


118 


dxs 


b0420 


150 


0.82 


folC 


b2315 


179 


0.89 


mobA 


b3857 


49 


0.63 


tbpA 


b0068 


76 


0.69 



Phosphomannomutase 
Glucose-specific IIA component 
Enolase. Glycolysis 

Fructose-bisphosphate aldolase. Glycolysis 
Fructose-bisphosphatase 
Fructose-specific transport protein 
Fructose-specific HA/fipr component 
Fructose- 1 -phosphate kinase 
Glyceraldehyde-3 -phosphate dehydrogenase A. 
Phosphoglyceromutase 1 . Glycolysis 
PTS enzyme IIC. Mannose-specific 
Beta-hexosaminidase. Cell wall synthesis 
6-phosphofructokinase I. Glycolysis 
Phosphoglycerate kinase. Glycolysis 
PEP-protein phosphotransferase system enzyme I 
Hexulose-6-phosphate synthase 
Hexulose-6-phosphate isomerase 
Putative shikimate transport protein 
Part of regulation of tor operon. 
Triosephosphate isomerase. Glycolysis 

Putative carbamate kinase. Arginine degradation 
Ornithine carbamoyltransferase 2 
DAHP synthetase. Aromatic amino acids biosynthesis 
DAHP synthetase. Aromatic amino acids biosynthesis 
Asparagine synthetase B 
6-phosphogluconate dehydratase 
Gamma-glutamyl transpeptidase 
Glutamine synthetase 

Regulatory protein P-II for glutamine synthetase 

Imidazole glycerolphosphate dehydratase 

Histidinol-phosphate aminotransferase 

ATP phosphoribosyltransferase 

Phosphoribosyl-ATP pyrophosphatase 

ilvGEDA operon leader peptide 

Oligopeptide transport. Periplasmic binding protein 

Oligopeptide transport. Permease protein 

Oligopeptide transport. Permease protein 

ATP -binding protein of oligopeptide transport system 

ATP-binding protein of oligopeptide transport system 

Proline dipeptidase 

Tryptophan synthase, alpha protein 

Tryptophan synthase, beta protein 

Adenylate cyclase 
Purine salvage 
Thymidine kinase 

Biotin biosynthesis 

l-deoxyxylulose-5-phosphate synthase. Flavoprotein 

Dihydrofolate synthetase 

Molybdopterin 

Thiamin-binding periplasmic protein 



/ ; Lipid metabolism : none 
Q : Secondary metabolites biosynthesis, transport and metabolism 

pmbA b4235 149 0.82 Maturation of antibiotic MccB17 



CELLULAR PROCESSES 
D: Cell division and chromosomal partitioning 

zip A b24 12 126 0.78 Cell division protein involved in FtsZ ring 

O : Post-translational modification, protein turnover, chaperones 

clpA b0882 1 33 0.79 ATP-binding component of serine protease 

JkpB b0028 50 0.64 Peptidyl -prolyl cis-trans isomerase (a rotamase) 

ppiB b0525 52 0.64 Peptidyl-prolyl cis-trans isomerase B (rotamase B) 

M+7V ; Cell envelope biogenesis and secretion 

cpsB b2049 91 0.72 Colanic acid biosynthesis 

dacA b0632 34 0.58 D-alanyl-D-alanine carboxypeptidase 
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exbB 
exbD 
Ipp 
IpxD 
pbpG 
sohB 
yJbE 



b3006 16 0.46 Uptake of enterochelin 

b3005 15 0.46 Uptake of enterochelin 

bi677 24 0.53 Murein lipoprotein 

bO 1 79 84 0.71 Third step of endotoxin (lipid A) synthesis 

b2134 1 28 0.78 Penicillin-binding protein 7 

bl272 60 0.66 Putative protease 

b2253 64 0.67 Putative enzyme 

tmnvnnrt and metabolism 

Iron storage and mobility [2Fe-2S] 
Ferrous iron transport protein A 
ferrous iron transport protein B 
Ferric hydroxamate transport 
Formate transporter 

Large subunit of phenylpropionate dioxygenase 



P : Inorganic ion transport and metabolism 

bfd b3337 10 0.41 

feoA b3408 12 0.43 

feoB b3409 57 0.66 

JhuF b4367 31 0.57 

focA b0904 2 0.30 

hcaAl b2538 86 0.72 

T: Signal transduction mechanism : none 



NOT CHARACTERIZED 



yncE 

yhiX 

yfiD 

yodB 

ynfK 

ycgT 

yebL 

yeeX 

ykgM 

ybaO 

etp 

y& H 

yqhC 

yhcJ 

yeiA 

ybgS 

yhjW 

ydgF 

ybcC 

ybjW 

yjiL 

ctsA 

ydjG 

yeaU 

y&u 

yejO 
yeiC 
ynjE 
yjbC 
yadF 
essD 
ynel 
perM 

yjjP 

yhdX 

yihO 

ycgS 

yqhH 

b0878 

ygfH 

yegH 
ydiF 
yeeZ 
ydhM 
ydjK 



bl452 


5 


0.34 


b3516 


8 


0.39 


b2579 


13 


0.44 


bl974 


21 


0.51 


bl593 


22 


0.52 


bl200 


26 


0.53 


b!857 


35 


0.59 


b2007 


37 


0.60 


b0296 


51 


0.64 


b0447 


54 


0.65 


b0982 


55 


0.66 


b3074 


67 


0.67 


b3010 


68 


0.67 


b3223 


73 


0.68 


b2147 


77 


0.69 


b0753 


82 


0.71 


b3380 


85 


0.71 


M600 


98 


0.74 


b0539 


99 


0.74 


b0873 


100 


0.74 


b4334 


105 


0.75 


b0598 


112 


0.76 



bl771 

bl800 

b3089 

b2190 

b2166 

b!757 

b4022 

b0126 

b0554 

b!525 

b2493 

b4S64 

b3269 

b3876 

blJ99 

b3014 

b0878 

b2920 

b2063 

bl694 

b2016 

bl649 

bJ775 



114 

116 

117 

121 

124 

130 

142 

147 

151 

152 

156 

157 

160 

167 

169 

173 

175 

177 

178 

181 

182 

185 

188 



S : Function unknown 

b3007 b3007 3 

yfjF b2618 6 

ynaK bl365 7 

b3004 b3004 9 



0.77 
0.77 
0.77 
0.78 
0.78 
0.79 
0.81 
0.82 
0.82 
0.82 
0.83 
0.83 
0.83 
0.85 
0.85 
0.86 
0.87 
0.88 
0.88 
0.89 
0.90 
0.91 
0.93 

0.30 
0.35 
0.37 
0.39 



i only 

Putative receptor 

Putative AraC-type regulatory protein 

Putative formate acetyltransferase 

Putative cytochrome 

Putative dethiobiotin synthetase 

Putative dihydroxyacetone kinase 

Putative high-affinity zinc uptake system protein 

Putative alpha helix protein 

Putative ribosomal protein 

Putative Lrp-like transcriptional regulator 

Putative protein-tyrosine-phosphatase 

Putative tRNA synthetase 

Putative AraC-type regulatory protein 

Putative enzyme 

Putative oxidoreductase 

Putative homeobox protein 

Putative mutase 

Possible chaperone 

Putative exonuclease 

Putative prismane 

Putative enzyme 

Putative carbon starvation protein 
Hypothetical oxidoreductase 
Putative tartrate dehydrogenase 
Putative symporter protein 

Putative ATP-binding component of a transport system 

Putative sugar kinase 

Putative thiosulfate sulfur transferase 

Putative pseudo-uridine synthase 

Putative carbonic anhydrase 

Lysis protein homolog to lambdoid prophage DLP12 

Putative aldehyde dehydrogenase 

Putative permease 

Putative structural protein 

Putative transport system permease protein 

Putative permease 

Putative dihydroxyacetone kinase 

Putative lipoprotein 

Putative membrane protein 

Putative coenzyme A transferase 

Putative transport protein 

Putative enzyme 

Putative enzyme of sugar metabolism 
Putative transcriptional regulator 
Putative transport protein 

Unknown 
Unknown 
Unknown 
Unknown 
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yodA 


bl973 




ymfA 






yjgO 


U A~i C C 


Zo 


yeaQ 


bl795 


30 


yail 


LAI 0*7 


j J. 


yceD 


bl088 


46 


bOJOO 


LA 1 AA 

DOIUU 


/ J 


ydcN 


U 1 A 1 A 

bl4J4 


/o 


ygiH 


b3059 


83 


ytjl 


042 1_> 


R7 
o / 


ymfO 


Ui 1 CI 


£Q 

07 


ytJH 


D4z 1 1 




ynhA 


b!679 


AC 


ybaM 




Q"7 


ynfB 


bl 583 


1 f\A 

104 


ydgA 


l_ 1 /C 1 A 

bl614 


1 tV7 
1 U / 


yggJ 


b2946 


1 1 Pk 

1 1U 


yadS 


b0157 


1 1 1 


yfeK 


b2419 


1 I J 


ycgR 


bl 194 


122 


yfdS 


b2362 


1 oi 


yadH 


b0128 


1 oc 


yhhZ 


b3442 


1 J 1 


yhU 


b3488 


1 71 


ycbJ 


b0919 


i n 


elaA 


b2267 


1 Jo 


ybhN 


bl788 


1 io 
1 


ydgH 


bl604 


i Art 
14U 


yjjR 


b2634 


141 


ynfC 


Li roc 
Dl Doj 


1 J J 


yhgG 


b3410 


154 


ydjZ 


b!752 


155 


ydjY 


bl751 


163 


yhbV 


b3159 


168 


b2791 


b2791 


171 


yn/B 


b!754 


184 



0.48 

0.52 

0.56 

0.57 

0.57 

0.62 

0.69 

0.69 

0.71 

0.72 

0.72 

0.73 

0.73 

0.73 

0.75 

0.75 

0.76 

0.76 

0.77 

0.78 

0.78 

0.78 

0.79 

0.79 

0.81 

0.81 

0.81 

0.81 

0.81 

0.82 

0.82 

0.83 

0.84 

0.85 

0.86 

0.91 



Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 



10 



15 



20 
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Table 5. Genes over-expressed (^2) in E. coli TGI biofilm versus both exponential 
and stationary growth phase. 



a : Gene names according to E. coli Colibri database, 
b : Gene names according to Blattner nomenclature. 

c : Ratio of gene expression in E. coli biofilm versus exponential growth phase. 

d : Ratio of gene expression in E. coli biofilm versus stationary growth phase. 

e : Function description according to E. coli Colibri database. 

Arrow: mutants where biofilm formation were reduced compared to wt 

The genes have been classified according to the COGs functional categories annotation 

system used by the NCBI. 



Genes Bio/Exp Bio/Sta 
a b c d_ 



Function - description 



INFORMATION STORAGE AND PROCESSING 
J : Translation, ribosomal structure and metabolism 

rne b!084 2.06 3.57 RNase E 



K : Transcription , . 

-> IctR b3604 4-76 8.07 Regulatory protein for L-Lactate dehydrogenase genes 

L : DNA replication, recombination and repair 

dinJ b!061 2.02 2.92 Inhibits RecA-mediated self-cleavage. SOS 



METABOLISM 

C : Energy production and conversion 

glpQ b2239 2.50 2.15 Glycerol-3-phosphate diesterase 

-*mdh b3236 2 19 3.68 Malate dehydrogenase 

sixA b2340 2.74 3.24 Phosphohistidine phosphatase affecting phosphorelay of ArcB 

G : Carbohydrate transport and metabolism 

lamB b4036 2.94 3.73 Maltose high-affinity receptor 

-» rbsB b375 1 2.41 3. 83 D-ribose periplasmic binding protein, chemotaxis 

E : Amino acid transport and metabolism 

gadA b3517 3.15 4 84 Glutamate decarboxylase isozyme 

F : Nucleotide transport and metabolism : none 
H : Coenzyme metabolism : none 
J : Lipid metabolism : none 

Q : Secondary metabolites biosynthesis, transport and met abolism : none 

CELLULAR PROCESSES 

D: Cell division and chromosomal partitioning : none 

O : Post-translational modification, protein turnover. Chaperones : none 

pspA bl 304 8.42 3.86 Phage shock protein. Inner membrane protein 

pspB bl305 2.04 3.44 Phage shock protein 

pspC bl 306 5-58 2.55 Phage shock protein. Activates phage shock-protein expression 

pspD bl307 5.61 2.48 Phage shock protein 

_ t a i E b0627 2.12 5 .02 Membrane translocation of folded periplasms proteins 

P : Inorganic ion transport and metabolism : none 

T : Signal transduction mechanism : t 

-» cpxP b3914 22.9 13.15 Suppresses toxic envelope protein effects. CpxA/R activated 

rseA b2572 2. 1 5 2.28 Negative regulatory protein of sigma-E factor 

rpoE b2573 1.98 5.86 Extracytoplasmic sigma E factor 

_ t>1743 3J3 5.02 Periplasmic protein related to spheroblast formation 
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NOT CHARACTERIZED 

R : Function unknown : General prediction only 



yebE bl846 5.47 

-+yqcC b2792 4.45 

-+yfcX b2341 2.75 

yjbO b4050 2.58 
S : Function unknown 

-+yceP bl060 3.06 

-veiB b3037 2.15 



3.14 Similarity to an Y.enterocolitica protein 

2.31 Similarity to E.carotovora orfl exoenzyme 

4.83 Putative fatty oxidation complex alpha subunit 

5.66 Similarity to a putative exported Y. pestis protein 

6.46 Unknown 

2.09 Unknown 



Table 6. Inactivation of the genes described in the study and TGlg/p strain 
construction: primers used in the linear DNA, 3 step PCR inactivation protocol. 

5 a : Gene names according to E. coli Colibri database. 
b : nomenclature according to Institute Pasteur database. 



* : Genes inactived by a removable fit kanamycin cassette. 



Target 
genes a 


K 

Primers name 


SEQ 
ID 
NO 


k r llllcl a aCljUCiivi/ 


cpxP* 


CpxP.AI .500-5 
CpxP.BI .500-3 
CpxP.B2.frtL-5 
CdxP A2 frtL-3 
CpxP.ext-5 
CpxP.ext-3 


1 
2 
3 
4 

5 
6 


5'CGGCATCATTACGTCAAGCAAAAG3' 

5'GCGCCAGCGCCGCGAGGGACTCAG3' „™. 
5'GAACTTCGGAATAGGAACTAATAGTAAACCCTG I I I I CCTTGCC3 
AGC A&CTCCAGCCT ACACCATCATTTGCTCCCAAAATCTTTCi' 

5'CCCGAATTCCGAAGTGC 1 1 1 1 AATGTGTCG3' 

5'CGCCTGGATCTGTCATCGGTG3* _ _ 


cpxR* 


CpxR.AI ..500-5. 
CpxR.BI .500-3 
CpxR.B2.frtL-5 
CpxR. A2 .frtL-3 
CpxR.ext-5 
CpxR.ext-3 


7 
8 
9 

10 
11 
12 


5'CGTGAGTTGCTACTACTCAATAG3' 
5'GCCGGACGAATCAGATAAAG3' 

5'GAACTTCGGAATAGGAACTAAGGTTTAAAACCTTGCGTGGTC3 
5'GAAGCAGCTCCAGCCTACACGAAATTACGTCATCAGACGTCGC3' 

f-i^ a -i — r/~« att^ATA AATArTrf! 1 ^' 

5 GATTGA1 1 OA 1 AAA I av^ i ^oo 

5'CAAACAGTAAGTTAATGAAATC3' . 


cutC 


CutC. A1. 500-5 

CutC.BI. 500-3 

CutC.B2.GBL-5 

CutC.A2.GBL-3 

CutC.ext-5 

CutC.ext-3 


13 
14 
15 

16 


5'CACTATTGCATCAGAAGCGG3' 
5'CCTTTCTGGTTCGAAAAGTGG3' 

5-CTTCAC6A&GCAGACCTCA6CGCCT&ATTTTTACCGTTGCATCAT&TCGC3 
5'GATTTTGAGACACAACGTGGCTTTCA 1 1 1 1 1 ACTCCTJ AATT ACGCCGACZ 




17 
18 


5'GGAATACCTTACATTGATGA3' 

5'CTTTAGATGCCTTTAATTTAG3' ; 


cyoC 


CyoC .A1 .500-5 

CyoC.BI .500-3 

CyoC.B2.GBL-5 

CyoC.A2.GBL-3 

CyoC.ext-5 

CyoC.ext-3 


19 
20 
21 
22 

| 23 
I 24 


5'CCATGCTGATGATTGCAGCC3' 

5'CCGACGCCACAACCAGTGAC3' _ A ^, 
5-CTTCACGAGGCAGACCTCAGCGCCTAATGAGTCATTCTACCGATCAC3 

5'GATTTTGAGACACAACGTGGCTTTCATTTTTCAGCCCTGCCTTAGTAATC3' 
5'CAGGGATGACCTACTGGTGG3' 

5-GGATTCGCGCCAAACCACAG3' 


dinl 


Dinl.AI .500-5 

Dinl.BI .500-3 

Dinl.B2.GBL-5 

Dinl.A2.GBL-3 

Dinl.ext-5 

Dinl.ext-3 


' 25 
26 
27 
28 

29 
30 


5'GTTTAACCGCAACCATATGC3' 
5'CGATTCCTGCTTCTAATATC3' 

5'CTTCACGAGGCAGACCTCAGCGCCTAATATGCAGTGAl 1 I I M I iGCC3 
5'GATTTTGAGACACAACGTGGCTTTCATAATAGCCCCCTGTTGAA3' 

5'CCTGACTGCGCTGAAAGTCG3' 

5'GACGCCGATACTCGTTTACC3' ._. ___ 
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\ecO 1 


EcO.AI .500-5 : 
-cO.B1.500-o 
EcO.B2.GBL-5 : 
EcO.A2.GBL-3 : 
EcO.ext-5 
EcO.ext-3 


51 ; 

33 - 

34 i 

35 
36 


j'CGCCGCGTTGCAGAATGTTG3' 

D CCGGATG 1 bbL»0 1 A 1 1 OM 1 M/AOMUOUO 

5'CTTCACGAGGCAGACCTCAGCGCCCAACGCGGTAGTTCGCTAAACTGCCG3' 
yGATTTT&AGACACAACGTG&CTTTCCATTTTTTT6CTTTCCTTC3- 

5'ATTTTTGAAATTAACGCTCG3' 
5'GTTGAAACCGCAACCCGTTC3' 


\fadB 


FadB.A1 .500-5 

FadB.BI .oOO-o 

FadB.B2.GBL-5 

FadB.A2.GBL-3 

FadB.ext-5 

FadB.ext-3 


37 
oo 
39 
40 

41 
42 


5'GATCACTTCCACATCTTCAG3' 
5 GATTTCA 1 1 1 1 1 AAA 1 oowo 

S'CTTCACGAGGCAGACCTCAGCGCCTAAGGAGTCACAATGGAACAGGTTGS" 
5'GATTTTGAGACACAACGTGGCTTTCATGTCAGTCTCCTGAATCC3' 

5'CTGGCCTCAATACCCAGTTG3' 

5'GTTTACTGGATCAAACGCCGGACGC3' . _ . 


YfdhF 


FdhF.AI .500-5 

FdhF. B1. 500-3 

FdhF.B2.GBL-5 

FdhF.A2.GBL-3 

FdhF.ext-5 

FdhF.ext-3 


43 

A A 

44 
45 
46 

47 
48 


5'GTCTGCAAACGCTCAACGAC3' 
5 GTCGTTCTCCAGATCTTUOoo 

5'CTTCACGAGGCAGACCTCAGCGCCTAATACCGTCCI 1 ICTACAG3' 
5'GATTTTGAGACACAACGTGGCTTTCCATCGGTCTCGCTCCAGTTAATC3' 

5'GCCGCTGTTTGACGGTGGAC3' 

5'CGCCCAGTACTCGGAATAAC3' 


\gadA 


GadA.AI .500-5 

GadA.BI .500-3 

GadA.B2.GBL-5 

GadA.A2.GBL-3 

GadA.ext-5 

GadA.ext-3 


49 
50 
51 
52 

53 
54 


5'CCTTTGAACCGTTGGGGCTG3' 
5 CTTATCTACTCGAA 1 1 1 GGC3 

5'CTTCACGAGGCAGACCTCAGCGCCGATAACATAACGTTGTAAAAAC3 
$'G ATTTTG AG ACAC A ACGTGGCTTTCATTTCG A ACTCCTT A A ATTTATTTG3' 

5'GTTGCGCGGAGATGAAAATG3' 

S'CATGAAGATTTAATGCCTCCS'- — 


UctR 


LctR.AI .500-5 

LctR.BI .500-3 

LctR.B2.GBL-5 

LctR.A2.GBL-3 

LctR.ext-5 

LctR.ext-3 


55 
56 
57 
58 

59 
60 


5'GCACTGCTCTCGATTGTCTG3* 
5 GGGCCGCTCATACCTGAA Ha J 

5'CTTCACGAGGCAGACCTCAGCGCCTGATTATTTCCGCAGCCAGCGAT3 
5'GATTTTGAGACACAACGTGGCTTTCCATTAAGGAATCATCCACGTTAAG3' 

5'GGTGGCGCGCTGTATGAGTG3' 

!VCOTAAATCATGTGGACGACC3' _ 


malM 
to 

malG 


MalMG.AI .500-5 

MalMG.BI. 500-3 

MalMG.B2.GBL-5 

MalMG.A2.GBL-3 

MalMG.ext-5 

MalMG.ext-3 


61 
62 
63 
64 

65 
66 


5'ACGACTCCAGCGGATCGCGCGGCAAC3' 
5 CAATAGTGGAAT1 \j 1 1 1 1 1 a I 

5'CTTCACGAGGCAGACCTCAGCGCCTAGCCCTTGTGGAGGTTCCTGCAAT3' 
5'GATTTTG AG AC AC A ACGTGGCTTTCATTTCTC A TCCTTGTTTTATC3' 

5'GGTTTTCGACCAGTTTGACTAAG3' 

5'CGTTGGTGCTGTTAGCACTGTATC3' _ _ _ _ _ 


mdh 


Mdh.AI .500-5 

Mdh. B1 .500-3 

Mdh.B2.GBL-5 

Mdh.A2.GBL-3 

Mdh.ext-5 

Mdh.ext-3 


67 
68 
69 
70 

71 
72 


5'GCATAAGTCACCCGATATGGTGG3' 

5 CTCGCTGGGCGAAU 1 vjA i Ij<j<j.3 _ 

S'CTTCACGAGGCAGACCTCAGCGCCTAATTGATTAGCGGATAATAAAAAACS 

5-GATTTTGAGACACAACGTGGCTTTCATCCTAAACTCCTTATTATATTG3' 
5'CTGCAACGCGGCGACGATTTC3' 

5'GGCAAAACTTCCTCCAAACCG3' - 


\nifS 


NifS.AI .500-5 
| NifS. B1. 500-3 
! NifS.B2.GBL-5 

NifS.A2.GBL-3 

NifS.ext-5 

NifS.ext-3 


73 
74 
75 
76 

77 
78 


5'CCTTTCTTATCTGGAACAAC3' 
5'CGCCCAGACGCAGGCCAAAC3' 

5'CTTCACGAGGCAGACCTCAGCGCCTAATCGGTATCGGAATCAG3 
5'GATTTTGAGACACAACGTGGCTTTCATTGCTCTATAAACTCCGTACATCAC3' 

5'CATGAGACTGACATCTAAAG3' 

5'CTTCTTTTACGAAGTCCAGC3' - 
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nifU 


NifU.AI .oOu-o 


70 

f y 


5 CATCGCAAAAGAAGAGATGG3 


NlfU.BI .500-3 


Q A 
80 


5 CTCAGCGCCTGGGTATCGAG3 




NifU B2 GBL-5 


81 


R'rTTrArGAGGCAGACCTCAGCGCCTAAGAGTTGAGGTTTGGTTATGS' 




Mifi 1 A9 f^RI 

IN 1 IU .r\£- . O D L.-0 


89 


5 GATTTTG AG AC AC A ACt7Tt7(-7C 1 1 I CA l 1 /\ 1 Ann i icilci t?M i i o«3 




NifU ext-5 








NifU.ext-3 


83 


5'GGTGCGCTGTATGTACGTCG3* 






84 


5'GGTTAATGGTTGCAGATTGC3' 


nlpE 


NIpE.AI .500-5 


00 


5 ACATGTTGCTATTCCCGATCjo 


NIpE.BI .500-3 


OD 


5 GCAGTGTGGGCGAAGGAGAC3 




NlnF B2 GBL-5 


87 


^•rArGAGGCAGACCTCAGCGCTAACCCGTCTTGAGACAGAAACAAAC3' 




MlnP A9 f^RI -1 




5 TTGA& AC AC A ACG\&bC\\\C A l CCA 1 ICC I loi i i i iai i ccct?o 




IN 1 JJ C . t5 A l-O 








NlnE ext-3 


89 


57^TCTTTCCGTCTGGTATCTG3' j 






c i , nA^:TCGCCAGATGTGCTCAC3 , 


pspA to 


PspAE.AI .500-5 


91 


5 CCCGAGCTCACCATCATCGGTGCCGTAGOGAC33 




PspAE.BI. 500-3 


r\o 

92 


5 , GATAATCAATTACCGAAAAGCCATC3 , 


pspE 


pcnAF B2 GBL-5 


93 


^'rTTrArGAGGCAGACCTCAGCGCCTAAAAGAATTCACCATGAGCGGS' 




PcnAF A9 f^RI 


Q4 


cy. . — Ar-Ar>Af*AA r r~rr r r~\ — i — vrr a T a a T/CTT^T/TTrTTfiATTTrTA^' 
5 SATTTTGAGACACA ACt7Tt7t7C i 1 1 CCA I A A 1 1? 1 1 i cc \ c » i t7A i » i o i t70 




PcnAF pxt-fi 






PsdAE ext-3 


95 


5'CAGTTCACCGTACTCAATCACGC3 , 




Qfi 


^'CGAGTrGCTGAATATCCTGCCACTCCS* 


rbsB 


RbsB.AI .500-5 


97 


5 GGTATTGGTCGTCCGCTGGG3 


RbsB. B1 .500-3 


98 


Am*. Am^mm> Am*, al Am* ■ ■ *m\ Am\ a^\ ^T^T" A O ' 

5 CGCTCACGTTGCGCTTCCAC3 




DKcR R9 f5RI 
l\L)bD. DZ.VJDL'J 




c»r>TTr AmAnnPAnArPTCAGCGCCTAGTTTTAATCAGGTTGTATG3' 




nu n D AO D 1 Q 


1 nn 
I uu 


5GAI 1 1 \ GA&ACACA ACGTGGCTTTCATATTC A AC? A IC? ICC i C7 1 Afco 




DKcR ovt.R 
KDSD.oXl-O 






RhcR Pxt-3 


101 


5 , GGCGTGACCATGGTTTATAC3' 




*i no 


^•r^A A<^TTPf^Pr^AGnCGGAGCC3' 

O OMMO 1 I UuvyOnu \j\jr^<\^j \*s\-*\*f _ _ _ _ 


rpoE 


RpoE.AI .500-5 


103 


5 GACCTGATGCTGGTCAGCCAGGCGTAG3 




RpoE. B1. 500-3 


104 


S'CGCTTCAGAAGGTACTCCCAGS' 






105 


R•rTTrAmAGGrAGACCTCAGCGCCCAGGCGTTGACGATAGCGGG3 , 





n nn r AO r^D 1 0 




5 G ATTTTG AG AC AC A AC&Tt7t7CT \ \ CA 1 CCC7Afcrt7 i A a At7 l c i cootno 




Kpot.exi-o 






RnoF ext-3 


107 


5'GAACCTTCCGTTACCGGGCCTTTAC3 , 






i no 
1 Uo 


^ ' n p A A C ATTG C ATT AATG C G AC G AC 3' 


rseA* 


RseA.AI .500-5 


109 


5 GCATAAAGTGGCGAGTCTGG3 


RseA.BI. 500-3 


110 


5'GTAATTTCGATTCGGTGTCC3' 




Deo A R9 frtl 


111 

1 1 1 


a APTTPnn A ATAGGAACTAAGTTTGAGCAGGCGCAAACCCAGC3' 




D _ _ a AO f r~fl Q 

KSeA.AZ.TrtL-o 




5'GAAGCAGCTCCAGCCTACACCATGCCTAATACCC \ \ A \ CC3 




r\bCM.CAl 3 








RseA ext-3 


113 


S'GGTCCTGGTTGAACGGGTCCS' 






R , GTTCCAGCGTTTCACCATCG3 , 


rseB 


RseB.AI .500-5 


a a r- 

115 


5 CCATTTCGATATCTCTTCACo 


RseB. B1. 500-3 


116 


S'CGTCCTCGCATTTGTTATGCS' 




PcoR R9 f^Rl 


117 


R , PTTPAPr;AnGPAGACCTCAGCGCCATGATCAAAGAGTGGGCTAC3 , 




Rseb.Az.VjDL-o 


i 10 


5 GATTTTGAGACACAACGTGGCTTTCATTACTGCt7A » it7Ct7i icc^ 




KseD.exi-o 








RcpR pvt-3 

IXoCU .CAI \J 


119 


5 , CTTAATCCGTGACTCAATGC3 f 




1 on 


*Vf^AAATGTTCATACCGTATGG3 , 

^ 1 I 1 \ I r^www i ■ v-'v^w „ 


sixA 


SixA.AI .500-5 


4 0 h 
121 


5 CGCACCGCAGGTTGCTGAAC3 


QjvA B1 500-3 


122 


R'nTnATGTTTTCACTCCCCTGATTC3 , 




SixA.B2. GBL-5 


123 


5'CTTCACGAGGCAGACCTCAGCGCCTGATGAGTTCCAAATTATGC3' 




SixA.A2.GBL-3 


124 


5'&ATTTTSAGACACAACST(5GCTTTCATATTGCACCSCTTTTSTTAACCAG3' 




SixA.ext-5 






SixA.ext-3 


125 


5'GCTGATTGGCACACAAGGGC3' 




126 


5'CATTGATTCAGTCAATAGCCAATG3' . 



62 



f 



sodC \ 

< 
< 
i 


5odC.A1 .500-5 1 
DOdC.BI .500-3 
3odC.B2.GBL-5 ' 
3odC.A2.GBL-3 
3odC.ext-5 
3odC.ext-3 


27 f 
Izo J 

129 ; 

130 i 

131 ! 

132 ! 


>'GCAATCACGTCTGCCGTTTACC3* 
> GATCGGATGCTCGTAAAAGCCo 

j'CTTCACGAGGCAGACCTCAGCGCCCCGATCAACCTAAACCGCTGGG3' 
3'GATTTTGAGACACAACGTGGCTTTCATAGGACCTCCGTTCATTG3' 

5'CGTTCAAACATCTGCATCAGAG3' 

5'GGCGTCGCGTTGGCGTGGTTAG3' _ _ 


\spy 


Spy.AI .500-5 

Spy.BI. 500-3 

Spy.B2.GBL-5 

Spy.A2.GBL-3 

Spy.ext-5 

Spy.ext-3 


133 
134 
135 
136 

137 
138 


5'GACACGCTGAATTTTATGCC3' 
5 CTGCCCTGCCGTCAGTTTCG3 

S'CTTCACGAGGCAGACCTCAGCGCCTAATCTTTCAGCCAAAAAACTTAAGACS' 
5' GA TTTTGA&ACACA/\CGTGGCTTTCCATATTCTATATCCTTC(:TTTC3' 

5'GTCGGTATCGTGAGAACACC3' 

S'CTTACAGACATCCAGGCGTGS' _ . - ~ 


sucA 


SucA.M .500-5 

SucA.B1.500-3 

SucA.B2.GBL-5 

SucA.A2.GBL-3 

SucA.ext-5 

SucA.ext-3 


139 
140 
141 
142 

143 
144 


5'GGCTTGTTAGCGGCATATCG3' 
5 GACACG I 1 l l l CACTACGTG3 

5'CTTCACGAGGCAGACCTCAGCGCCTAAATAAAGGATACACAATG3' 
57SATTTTGAGACACAACGTGGCTTTCATCGTGATCCCTTAAGCATC3' 

5'CGCGAGCATTTACAGATGCC3' 

5'GCTTCACCGTACTGCTTACG3' _ _ . _ 


sulA 


SuiA.AI .500-5 

SulA.B1.500-3 

Su!A.B2.GBL-5 

SulA.A2.GBL-3 

SulA.ext-5 

SulA.ext-3 


145 
146 
147 
148 

149 
150 


5'C AGCTTCAGTTG ATTTCG CC3' 
5'CAGTTGG 1 1 1 1 CATGGGTCG3 

5' CTTCACGAGGCAGACCTCAGCGCCTAAGTAAATTTAGGATTAATCCTG3 
3GATTTTGAGACACAACGTGGCTTTCCATAATCAATCCAGCCCCT63' 

5' GCAAATCTTTCAGTCTTTCC3' 

5' CATTTCAAAGCCAACATACG3' _ - - 


\tatE 


TatE.A1.500-5 

TatE.BI .500-3 

TatE.B2.GBL-5 

-TatE.A2.GBL-3 

TatE.ext-5 

TatE.ext-3 


151 
152 
153 
154 

155 
156 


5'GTCTGATGACCTGTTATGAC3' 
5'CAACGCCACCAGATGTGTTC3' 

5'CTTCACGAGGCAGACCTCAGCGCCTGACGTGGCGAGCAGGACGC3 
5'GATTTTGAGACACAACGTGGCTTTCATAGATACCTTCTTGAC3' 

5TGATGCTGGTAATGAAATCG3' 

5'CGCGGTCGTATGGATCGTGC3' _ _ _ _ 


\ybeD 


YbeD.AI .500-5 

YbeD.BI. 500-3 

YbeD.B2.GBL-5 

YbeD.A2.GBL-3 

YbeD.ext-5 

YbeD.ext-3 


157 
158 
159 
160 

161 
162 


5TACTTTTAAAGGCCGTGAAG3' 

5'GCCCGAGGATGCGCTTCTAT3' 

5-CTTCACGAGGCAGACCTCAGCGCCTAACTCGCTTCTCCGTTAC3 

VGATTTTGAGACACAACGTGGCTTTCATGTCAGCTCCGGCGTAACy 
5'CGGACACACTGACAAAGCAG3' 

5'CCATATTGACGTTTAATGCC3' _ - _ - 


\ybjF 


YbjF.AI .500-5 

YbjF.BI .500-3 

YbjF.B2.GBL-5 

YbjF.A2.GBL-3 

YbjF.ext-5 

YbjF.ext-3 


163 
164 
165 
166 

167 
168 


5TCATGGAAGACGAAACGTTG3' 
5 CGG AAGTGAAAACTGTCTCT3 

5'CTTCACGAGGCAGACCTCAGCGCCTAAAAAAGCCGCATGTG3 
5'GATTTTGAGACACAACGTGGCTTTCATACATTGACCTTCACATC3' 

5'CAACCTGGCTACATAATGCC3' 

5'GATACCTACAAAACGTTTGC3' . _. - - 


\yccA 


YccA.AI .500-5 

YccA.BI .500-3 

YccA.B2.GBL-5 

YccA.A2.GBL-3 

YccA.ext-5 

YccA.ext-3 


169 
170 
171 
172 

173 
174 


5'CGGGCGGTGGGGATGTTTAG3' 
5'CAGTGGTTAAAGAGTGGCGG3' 

5'CTTCACGAGGCAGACCTCAGCGCCTAATCTCACCCGCTAACAC3' 
5'GATTTTGAGACACAACGTGGCTTTCATTGAGTCACTCTCTATG3 

5'CTGCACTGGCGCACGTCGCC3' 
5'CGATGGCAGCGTGGAAGTGG3' 
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yceP 



ycfJ 



ycjL 



ycfR 



ydcl 



yebE 



yfcx 



YceP .A1 .500-5 

YceP.BI .500-3 

YceP.B2.GBL-5 

YceP.A2.GBL-3 

YceP.ext-5 

YceP.ext-3 



YcfJ.AI .500-5 

YcfJ.BI .500-3 

YcfJ.B2.GBL-5 

YcfJ.A2.GBL-3 

YcfJ.ext-5 

YcfJ.ext-3 



yggN 



YcfL.A1 .500-5 

YcfL.BI .500-3 

YcfL.B2.GBL-5 

YcfL.A2.GBL-3 

YcfL.ext-5 

YcfL.ext-3 



YcfR.AI .500-5 

YcfR.B1.500-3 

YcfR.B2.GBL-5 

YcfR.A2.GBL-3 

YcfR.ext-5 

YcfR.ext-3 



175 
176 
177 
178 



5'GCGAAAACTTCTCCATTGCC3' 
5'CAGCGGGCCATAATCCCTTG3' 

5'CTTCACGAGGCAGACCTCAGCGCCTAACATGACATGACCATCC3' 
5'GATTTTGAGACACAACGTGGCTTTCATCATGGCCCCCTAATTCG3' 



179 5'CCAGTATATTCAACAGGGGG3' 



180 



Ydcl.AI .500-5 

Ydcl.B1.500-3 

Ydcl.B2.GBL-5 

Ydcl.A2.GBL-3 

Ydcl.ext-5 

Ydcl.ext-3 



YebE.AI .500-5 

YebE.BI .500-3 

YebE.B2.GBL-5 

YebE.A2.GBL-3 

YebE.ext-5 

YebE.ext-3 



YfcX.AI .500-5 

YfcX.BI .500-3 

YfcX.B2.GBL-5 

YfcX.A2.GBL-3 

YfcX.ext-5 

YfcX.ext-3 



181 
182 
183 
184 

185 
186 



5'CTTCGCCAGTTG GATCCAGG3 ' 



187 
188 
189 
190 

191 
192 



5'CAGGCTGCACACCAGATGGC3' 
5'CGGAATTTACCAACAAAGAG3' 

5'CTTCACGAGGCAGACCTCAGCGCCTAACAAGGCTGTACTCTG3' 
5'GATTTTGAGACACAACGTGGCTTTCACGGGAACACCTCCTTC3' 

S'CAGACATTTACGCTATTGGCS' 

S-GGACCTCGTCGAAG CGACCG3' 



193 
194 
195 
196 

197 
198 



5'GATATATACGGCAGCAAAAC3' 
5'GGCAATGCCTATGGCTTTAC3' 

5'CTTCACGAGGCAGACCTCAGCGCCTAAGGGGTGAATCTTGATG3' 
5'GATTTTGAGACACAACGTGGCTTTCATCGTTACAGACCTTTATG3' 

5'GCGATTATATTTAGTGTGCG3' 

5'CTGACCAGATAAT TTCGCC C3' 



199 
200 
201 
202 

203 
204 



5'CAGCTGTGCTTCATGCTTAG3' 
5'GCCGGCTGGACTGGATAACC3" 

5'CTTCACGAGGCAGACCTCAGCGCCTAAGCATTAACCCTCATT3' 
5'GATTTTGAGACACAACGTGGCTTTCATAATAGTGGCCTTATGC3' 

5'CATGAAGCAGCCTGCCGGGG3' 

S'GACAAACGTGCA^CCCAACS' _ 



205 
206 
207 
208 

209 
210 



S'GTCGAATGTACCGGCACCCCS' 

5'CATCAACAGTATTGCTTTCC3' 

5-CTTCACGAGGCAGACCTCAGCGCCTGAAAGGTGAAGGGATCTGTC3 

5'GATTTTG AG AC AC A ACGTGGCTTTC AT A AGCGAT&TT AAA A AC3' 
5'GCGTGTCGTATTCTTCTTGC3' 

5'CGCTTCATCTCACTGAGGAC3: _ 



YggN.AI .500-5 

YggN.BI .500-3 

YggN.B2.GBL-5 

YggN.A2.GBL-3 

YggN.ext-5 

YggN.ext-3 



211 
212 
213 
214 

215 
216 



5'CAAAAAATTGTCGGTCAGGC3' 

5'GCATATTCACAGCCTGGTTC3' 

5'CTTCACGAGGCAGACCTCAGCGCCTAATTCCGCTCTCTGGATAG3 

5-GATTTTGAGACACAACGTGGCTTTCATATTTGCTCCTCAATAAC3' 
5'GTGAAGATCTGGATGCTGCC3' 

S'GGTGTTATCGGGCGTAATCGS' _ _ — 



217 
218 
219 
220 

221 
222 



5'CGCAAACACGGAACGGTAAC3' 

5'GAG ATCACCAGTACCGAAGC3" _„-™-^„ A o. 
5'CTTCACGAGGCAGACCTCAGCGCCTAAGAAGGTCAAAGCTATATGAA3 

5'GATTTTGAGACACAACGTGGCTTTCATTATTCCGCCTCCAGAACCA3' 
5'G GTG ATG ACTG CCTTTATCC3' 

S'CATCTTCAGATT^CACGGGCS'^. . 



5'GAACCGTAGCCGTCGTCTGC3' 

5'CATCGTGTCGGTACCGTGGG3' 

5'CTTCACGAGGCAGACCTCAGCGCCTAATCCTCTATTTTAAGACG3' 

5'GATTTTGAGACACAACGTGGCTTTCATAGTCTTCCCTCAAG3' 

5'GTGATGTCTTCTATTGACGG3' 
5'GTTGGCGGAGGCTTTATCAG3' 
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I* * 

/ 



yghO 


YghO.AI .500-5 : 

Ygnu.Di .ouu-o 

YghO.B2,GBL-5 

YghO.A2.GBL-3 

YghO.ext-5 

YghO.ext-3 


223 

225 
226 

227 
228 


5'CGACCAAGGTGCCTTGAGTC3' 
5 GCAGCCGCGAACGO 1 G 1 AUW 

5'CTTCACGAGGCAGACCTCAGCGCCTAATACCAGCTAACTCAGGTTC3' 
SGATTTTGAGACACAACGTGGCTTTATTAAGGAAGGTGCGAACAAGTCy 

5'CTGCTCTTTGTTCTTGGTCG3' 

5'GCGCAGGGTCGCGATTCTCG3* _ _ . _ 


ygiB 


YgiB.AI .500-5 

YgiB.B1.500-3 

YgiB.B2.GBL-5 

YgiB.A2.GBL-3 

YgiB.ext-5 

YgiB.ext-3 


ion 
229 

230 

231 

232 

233 

234 


«; • r* a t a a f* a P T A n T n ^ ' 
5'GTTCACGCAGCTCAACGAAG3' 

5 , CTTCACGAGGCAGACCTCAGCGCCTGATACCGATGGAAAGAGTC3 , 
S'GATTTTGAGACACAACGTGGCTTTCATTTTTGTCTTCCGGGACCS ' 
5'GAATGGTTAACTCGCAGGTG3' 

5'CCTGATCCTGTAAATCCGTG3' _ 


yhhY 


YhhY.AI .500-5 

YhhY.BI .500-3 

YhhY.B2.GBL-5 

YhhY.A2.GBL-3 

YhhY.ext-5 

YhhY.ext-3 


235 

237 
238 

239 
240 


5'CGCTGGTGAAATGGATATGG3' 
5 GATAAAAAAGCGCCTCTTAUo 

5'CTTCACGAGGCAGACCTCAGCGCCTAAGATAGTGCCC I I I I ICTG3' 
5'GATTTTGAGACACAACGTGGCTTTCATTCCTTTGTCCTCTTTGG3' 

S'GTTTCGCGTACTCGAAATGGS' 

S'CGATAAGATGTTGACAGAGGS' _ _ 


yiaH 


YiaH.AI .500-5 

YiaH. B1. 500-3 

YiaH.B2.GBL-5 

YiaH.A2.GBL-3 

YiaH.ext-5 

YiaH.ext-3 


241 
242 
243 
244 

245 
246 


5'GGAAAAAGCAGGGCTTAACG3' 
5 GTCAAATGCGTTTGTTTCGC3 

5'CTTCACGAGGCAGACCTCAGCGCCTAAGTAAAAGCCCGGTCACATTGGAC3' 
5'GATTTTGAGACACAAC(5TG6CTTTCATCTGTGTCTCTeTATCTG3' 

5'CAAGCCCTGGAAGGTCCTGG3' 

5'CATATCTGCCAGTTAGTTGC3' _ .. . _ 


yjbO 


YjbO.AI .500-5 

YjbO. B1 .500-3 

YjbO.B2.GBL-5 

YjbO.A2.GBL-3 

YjbO.ext-5 

YjbO.ext-3 


247 
248 
249 
250 

251 
252 


5'CGATTAACGGTGGTATCAAG3' 
5'CCGTGGGCAGAGACACCTGG3' 

5 , CTTCACGAGGCAGACCTCAGCGCCTAAGGGATTGTGCGGATGATCACAAC3' 
5'GATTTTGAGACACAACGTGGCTTTCATGATGCTCTCCCAAATATG3' 

5'GCAAAGGCGAGTGTGAGATG3' 

5'GAGCGGTTAAAAGAGATCAC3' _ ._ . . 


yrieA 


YneA.AI .500-5 

YneA.BI .500-3 

YneA.B2.GBL-5 

YneA.A2.GBL-3 

YneA.ext-5 

YneA.ext-3 


253 
254 
255 
256 

257 
258 


5'GGCTGCATAAAACCCATGCC3' 
5 CGACTGATGTTCATATTCGC3 

5'CTTCACGAGGCAGACCTCAGCGCCTGATGTGCATTACTTAACCG3' 
5-GATTTTGAGACACAACGTGGCTTTCATGAAGATATCCTTTATGG3- 

5'GCTAACCTGGATGTGCTGGG3' 
J 5'GGTACCGGACATCCGGCAAC3' _•_ _ _ 


yoaB 


YoaB.AI .500-5 

YoaB.BI .500-3 

YoaB.B2.GBL-5 

YoaB.A2.GBL-3 

YoaB.ext-5 

YoaB.ext-3 


259 
260 
261 
262 

263 
264 


S'CCGGCAGATCGCCCCCCGCCS' 
5 GGTGTTGGCGCTGATACATC3 

5 , CTTCACGAGGCAGACCTCAGCGCCTAAGCTTTATCGAAGCAAAATAAG3 , 
5'GATTTTGAGACACAACGTGGCTTTCATCATTTTGTCCTCATTATAC3' 

5'CCACGCCTGTGAATCTTCCG3' 

5'CCAGGGTTCCAGCCTTCCTG3' 


yqcC 


YqcC.AI .500-5 

YnrC R1 500-3 

YqcC.B2.GBL-5 

YqcC.A2.GBL-3 

YqcC.ext-5 

YqcC.ext-3 


265 
266 
267 
268 

269 
270 


5'CTGTAAGCGCCTTGTAAGAC3' 
5'CGAAGCTGATGTTTGCGTCC3 , 

5'CTTCACGAGGCAGACCTCAGCGCCTAATGCTGGAAATACTCTATC3' 
5' GATTTTG AG AC AC A ACGTGGCTTTC AT A A AGC A ACCTC A AT A AG3' 

5*CTTAAGCCTCTTCTGTAATC3' 
S'GGCCCGCGTGAATAGTCAGCS* 
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,- 1 - • 



yqeC 


YqeG.Ai .ouU-o 

YqeC. B1. 500-3 

YqeC.B2.GBL-5 

YqeC.A2.GBL-3 

YqeC.ext-5 

YqeC.ext-3 


272 
273 
274 
275 
276 


«VGGGGATGCCATTATGGAGTG3' 
5'CACCAAACGACTCAGCATGG3' 

5'CTTCACGAGGCAGACCTCAGCGCCTAGCGGCCCGGGTATTCCGGG3' 
5 ' G ATTTTG AG ACACAACGTGGCTTTCACG AGTCTTTATG ACCTC3 ' 
5'CTGCAI 1 1 1 CTATTTCGACG3' 

5'GAACCTTGCGACGACTTGCC3' _ _ .... „ 




ATT A1 500-5 
ATT.B1. 500-3 
ATT.B2.xfp-5 
ATT.A2.xfp-3 
ATT-ext5 
I ATT-ext3 


277 
278 
279 
280 
281 
282 


5'CGATGGCGATAATATTTCACC3' 
5*CCCTGATACTCACCAGGCATC3' 

5'TGAGTAGGACAAATCCGCCGCTAAAAAAGCAGGCTTCAAC3' 
5'GCGTTTTTTATTGGTGAGAATTACTAACTTGAGCGAAACG3' 
5'GGCGATAAATTGCCGCATCG3' 
5TGCCACCATCAAGGGAAAGCCC3' 



Table 7. Primers used for the Q-RT-PCR experiments. 

5 Primers were designed to amplify about 200-bp internal gene sequence. 



Target 
genes 


Primers 
name 


SEQ ID NO: j 

| Primers sequence 

l 
1 


cpxP 


cpxP-RT-5 
cpxP-RT-3 


283 ! 5' CGCTGGCAGTCAGTTCATTAAGCC 3' 

284 j 5 . GTCTCCAGTTCGCTAACATTAAC 3' 


cyoD 


cyoD-RT-5 
cyoD-RT-3 


285" ! 5' CTACCGATCACAGCGGCGCGTCCC 3' 
286 j 5 - GTTCCAGCCTTCATCTGATTTGG 3' 


jimA 


fimA-RT-5 
fimA-RT-3 


287 1 5' CTGGCAATCGTTGTTCTGTCGGCTC 3' 

288 j 5 - GCTCCTTCCTGTGCCAGCGATGCG 3' 


sucA 


sucA-RT-5 
sucA-RT-3 


289 i 5' GAACAGCTCTATGAAGACTTCTTAAC 3' 

290 | 5 ' GCTGCAGGACTTTAACCTGCTTCACAT 3' 


ycfJ 


ycfJ-RT-5 
ycfJ-RT-3 


29 1 | 5" GTTG GCGGGT ATCGGG ATTGGTGTC 3' 

292 ! 5' GTAATGCGA Till CATCCTGCACC 3' 


yc/R 


ycfR-RT-5 
ycfR-RT-3 


293 " j 5* CCCTCATCGCTGCGGCGATTTTAAGC 3' 

294 | 5' CCGGTT AC AG A AGT A AT ACGG A A AG 3' 


yebE 


yebE-RT-5 
yebE-RT-3 


295 i 5' GGCTGCTGGTCGCAAATAAATCAG 3* 

296 • 5' GCAAGGATCAAACGTGCTGTACGC 3' 



